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GENERAL   INTRODUCTION 

American    Chemical    Society    Series    of 
Scientific   and   Technologic   Monographs 

By  arrangement  with  the  Interallied  Conference  of  Pure  and 
Applied  Chemistry,  which  met  in  London  and  Brussels  in  July, 
1919,  the  American  Chemical  Society  was  to  undertake  the  pro- 
duction and  publication  of  Scientific  and  Technologic  Mono- 
graphs on  chemical  subjects.  At  the  same  time  it  was  agreed 
that  the  National  Research  Council,  in  cooperation  with  the 
American  Chemical  Society  and  the  American  Physical  Society, 
should  undertake  the  production  and  publication  of  Critical 
Tables  of  Chemical  and  Physical  Constants.  The  American 
Chemical  Society  and  the  National  Research  Council  mutually 
agreed  to  care  for  these  two  fields  of  chemical  development. 
The  American  Chemical  Society  named  as  Trustees,  to  make 
the  necessary  arrangements  for  the  publication  of  the  mono- 
graphs, Charles  L.  Parsons,  Secretary  of  the  American  Chemical 
Society,  Washington,  D.  C;  John  E.  Teeple,  Treasurer  of  the 
American  Chemical  Society,  New  York  City;  and  Professor 
Gellert  Alleman  of  Swarthmore  College.  The  Trustees  have 
arranged  for  the  publication  of  the  American  Chemical  Society 
series  of  (a)  Scientific  and  (b)  Technologic  Monographs  by  the 
Chemical  Catalog  Company  of  New  York  City. 

The  Council,  acting  through  the  Committee  on  National  Policy 
of  the  American  Chemical  Society,  appointed  the  editors,  named 
at  the  close  of  this  introduction,  to  have  charge  of  securing 
authors,  and  of  considering  critically  the  manuscripts  prepared. 
The  editors  of  each  series  will  endeavor  to  select  topics  which 
are  of  current  interest  and  authors  who  are  recognized  as  author- 
ities in  their  respective  fields.  The  list  of  monographs  thus  far 
secured  appears  in  the  publisher's  own  announcement  elsewhere 
in  this  volume. 
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4  GENERAL  INTRODUCTION 

The  development  of  knowledge  in  all  branches  of  science,  and 
especially  in  chemistry,  has  been  so  rapid  during  the  last  fifty 
years  and  the  fields  covered  by  this  development  have  been  so 
varied  that  it  is  difficult  for  any  individual  to  keep  in  touch  with 
the  progress  in  branches  of  science  outside  his  own  specialty. 
In  spite  of  the  facilities  for  the  examination  of  the  literature 
given  by  Chemical  Abstracts  and  such  compendia  as  Beilstein's 
Handbuch  der  Organischen  Chemie,  Richter's  Lexikon,  Ostwald's 
Lehrbuch  der  Allgemeinen  Chemie,  Abegg's  and  Gmelin-Kraut's 
Handbuch  der  Anorganischen  Chemie  and  the  English  and 
French  Dictionaries  of  Chemistry,  it  often  takes  a  great  deal 
of  time  to  coordinate  the  knowledge  available  upon  a  single  topic. 
Consequently  when  men  who  have  spent  years  in  the  study  of 
important  subjects  are  willing  to  coordinate  their  knowledge 
and  present  it  in  concise,  readable  form,  they  perform  a  service 
of  the  highest  value  to  their  fellow  chemists. 

It  was  with  a  clear  recognition  of  the  usefulness  of  reviews  of 
this  character  that  a  Committee  of  the  American  Chemical 
Society  recommended  the  publication  of  the  two  series  of  mono- 
graphs under  the  auspices  of  the  Society. 

Two  rather  distinct  purposes  are  to  be  served  by  these  mono- 
graphs. The  first  purpose,  whose  fulfilment  will  probably  render 
to  chemists  in  general  the  most  important  service,  is  to  present 
the  knowledge  available  upon  the  chosen  topic  in  a  readable 
form,  intelligible  to  those  whose  activities  may  be  along  a  wholly 
different  line.  Many  chemists  fail  to  realize  how  closely  their 
investigations  may  be  connected  with  other  work  which  on  the 
surface  appears  far  afield  from  their  own.  These  monographs 
will  enable  such  men  to  form  closer  contact  with  the  work  of 
chemists  in  other  lines  of  research.  The  second  purpose  is  to 
promote  research  in  the  branch  of  science  covered  by  the  mono- 
graph, by  furnishing  a  well  digested  survey  of  the  progress 
already  made  in  that  field  and  by  pointing  out  directions  in 
which  investigation  needs  to  be  extended.  To  facilitate  the 
attainment  of  this  purpose,  it  is  intended  to  include  extended 
references  to  the  literature,  which  will  enable  anyone  interested 
to  follow  up  the  subject  in  more  detail.  If  the  literature  is  so 
voluminous  that  a  complete  bibliography  is  impracticable,  a 
critical  selection  will  be  made  of  those  papers  which  are  most 
important. 
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The  publication  of  these  books  marks  a  distinct  departure  in 
the  policy  of  the  American  Chemical  Society  inasmuch  as  it  is 
a  serious  attempt  to  found  an  American  chemical  literature  with- 
out primary  regard  to  commercial  considerations.  The  success 
of  the  venture  will  depend  in  large  part  upon  the  measure  of 
cooperation  which  can  be  secured  in  the  preparation  of  books 
dealing  adequately  with  topics  of  general  interest;  it  is  earnestly 
hoped,  therefore,  that  every  member  of  the  various  organizations 
in  the  chemical  and  allied  industries  will  recognize  the  impor- 
tance of  the  enterprise  and  take  sufficient  interest  to  justify  it. 
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Preface 

It  is  the  purpose  of  the  following  book  to  give  in  a  single  place  a 
systematic  presentation  of  the  theory  of  statistical  mechanics,  together 
with  a  survey  of  most  of  its  fields  of  application  in  physics  and  chemistry. 
It  is  hoped  that  the  result  will  be  of  assistance  to  physicists  and  chemists, 
especially  to  the  latter  since  they  have  to  deal  for  the  most  part  with 
systems  so  complicated  that  statistical  methods  are  specially  advantageous. 

Certain  features  of  the  book  may  be  mentioned  in  advance. 

The  attempt  has  been  made  to  develop  the  theory  of  statistical 
mechanics  in  a  logical  fashion,  giving  special  attention  to  the  physical 
implication  of  the  concepts  employed  rather  than  to  mathematic  subtilities. 
It  is  hoped  that  the  frankness  with  which  the  ergodic  hypothesis  is  defi- 
nitely introduced  as  a  postulate  will  meet  with  approval,  and  that  the 
particular  form  of  this  postulate,  which  has  appealed  most  to  the  writer, 
will  also  seem  plausible  to  others. 

Chapters  have  been  included  which  present  the  elements  of  classical 
and  quantum  mechanics  in  sufficient  measure  to  serve  as  a  foundation  for 
statistical  mechanics.  The  treatment  of  quantum  mechanics  is  of  course 
only  partially  satisfactory  owing  to  the  rapidity  of  development  in  this 
field  of  investigation. 

Chapter  15,  on  general  principles  regarding  molecular  states  and 
molecular  processes,  contains  a  new  treatment  of  the  principle  of  micro- 
scopic reversibility  which  may  appear  complicated  but  will  perhaps  assist 
in  removing  earlier  illogicalities  in  the  treatment  of  this  principle. 

Other  original  contributions  in  the  book  include  the  deduction  of  a 
general  equipartition  law  which  holds  in  cases  where  the  principle  of 
equipartition  of  energy  breaks  down,  an  extension  of  Boltzmann's  H- 
theorem  to  include  radiation,  and  a  discussion  of  the  possibility  of  intro- 
ducing the  quantrum  theory  by  changing  the  dependence  of  energy  on  the 
coordinates  and  momenta  instead  of  by  concentrating  ^i-weight  on  the 
allowed  quantrum  states.  It  is  hoped  that  these  tentative  additions  to  the 
theory  may  prove  useful. 

Special  attention  has  been  given  to  the  application  of  statistical 
mechanics  to  rates  of  physical-chemical  change,  since  this  is  one  of  the 
fields  where  statistical  mechanics  most  clearly  demonstrates  its  superiority 
to  thermodynamics.  It  is  thought  that  the  treatment  in  a  single  place 
of  transport  problems  both  for  gases  at  moderate  concentrations  and  at 
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10  PREFACE 

high  dilutions  will  prove  useful.  Apology  should  possibly  be  offered  for 
presenting  a  treatment  of  chemical  reaction  rate  which  gives  perhaps  too 
much  attention  to  the  author's  own  interests  and  point  of  view  in  that 
field.     The  fault  is,  however,  a  natural  one. 

In  the  preparation  of  the  book,  the  author  has  often  had  to  make  use 
of  Boltzmann's  "Gas  Theorie",  Gibbs'  "Statistical  Mechanics",  Jeans' 
"Dynamical  Theory  of  Gases"  and  the  treatment  of  P.  and  T.  Eherenfest 
"Mechanik  der  aus  sehr  zahlreichen  diskreten  Teilen  bestehende  Systeme" 
and  specific  reference  to  these  works  has  for  the  most  part  been  omitted. 
References  to  newer  literature  have,  however,  often  been  inserted  with 
the  hope  that  they  will  prove  useful. 

Professor  Roscoe  G.  Dickinson  has  been  good  enough  to  read  all  the 
proof  and  to  make  many  valuable  suggestions  for  which  the  writer  is  very 
grateful. 

The  writer  would  be  glad  to  have  his  attention  called  to  errors  in 
the  text. 

Richard  C.  Tolman. 

California  Institute  of  Technology, 
August  30,  1926. 
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STATISTICAL  MECHANICS  WITH 

APPLICATIONS  TO  PHYSICS 

AND  CHEMISTRY 

Chapter  i 
The  Methods  of  Statistical  Mechanics 

1.  Introduction. — In  the  case  of  very  complicated  systems  such  as 
those  studied  by  the  chemist  and  the  biologist,  it  is  very  difficult  if  not 
impossible  either  to  observe  experimentally  or  predict  theoretically  the 
precise  behavior  of  all  the  elements  of  any  given  system.  Even  if  the 
chemist  were  told  at  a  certain  instant  the  exact  locations,  orientations,  and 
velocities  of  all  the  atoms  and  molecules  of  which  a  reacting  mixture  is 
composed,  and  given  the  precise  laws  governing  their  individual  behavior, 
the  complexities  of  calculation  would  prevent  him  from  predicting,  even 
a  fraction  of  a  second  later,  the  positions  and  chemical  combinations  of 
these  elements.  And  for  the  biologist  with  his  even  more  complicated 
systems,  which  consist  not  of  a  single  reacting  mixture  but  of  an  integrated 
combination  of  reaction  zones,  with  an  interplay  between  them  of  the 
physical-chemical  processes  taking  place  therein,  the  problem  would  be 
even  more  impossible  of  solution. 

Nevertheless,  in  spite  of  the  impossibility  of  precise  observation  and 
prediction,  the  chemist  and  the  biologist  have  not  been  deterred  from  their 
labors.  Even  if  they  can  not  follow  nor  predict  the  exact  behavior  of 
individual  elements,  they  can  observe  and  predict  with  remarkable  surety 
the  gross  behavior  of  their  systems  as  a  whole.  Each  time  that  the  chemist 
touches  a  match  to  a  soap  bubble  full  of  hydrogen  and  oxygen,  he  obtains 
an  explosion ;  each  time  the  bacteriologist  autoclaves  his  glassware,  he  kills 
the  bacteria  therein ;  and  each  time  there  is  a  war,  the  sociologist  can 
predict  that  rich  men  will  profiteer  and  poor  men  die.  These  compli- 
cated phenomena  do  not  lie  beyond  the  realm  of  law  and  predictability. 

To  investigate  the  laws  that  do  describe  the  gross  behavior  of  systems 
containing  many  molecules,  the  science  of  statistical  mechanics  has  been 
specially  devised.  The  essence  of  its  method  consists  in  abandoning  the 
attempt  to  predict  the  exact  behavior  of  each  molecule  of  a  given  system, 
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and  determining  by  statistical  methods  what  on  the  average  will  be  the 
behavior  of  a  great  number  of  systems  of  similar  constitution  to  the  one 
of  interest.  The  appeal  thus  made  to  the  methods  of  statistics  in  the  realm 
of  physical-chemistry  is  hence  quite  similar  to  the  use  of  statistics  in  the 
biological  sciences. 

It  is  the  purpose  of  the  following  book  to  develop  in  a  logical  manner 
the  principles  of  statistical  mechanics  and  apply  them  to  the  elucidation  of 
a  number  of  physical  and  chemical  phenomena.  Special  stress  will  be 
laid  on  applications  in  the  field  of  chemistry,  since  the  chemist  always 
deals  with  such  a  complicated  collection  of  atoms  that  other  methods  are 
hard  to  apply.  More  and  more  the  chemist  must  turn  to  the  methods 
of  statistical  mechanics  in  order  to  continue  the  theoretical  development 
of  his  science. 

2.  Comparison  of  the  Methods  of  Ordinary  and  Statistical 
Mechanics. — A  better  idea  of  the  methods  of  statistical  mechanics  will 
be  obtained  if  we  compare  them  first  with  those  of  ordinary  mechanics 
and  then  with  those  of  thermodynamics. 

To  take  a  concrete  illustration,  let  us  consider  a  gas  composed  of  a 
large  number  of  similar  molecules  moving  about  in  all  directions  inside 
the  walls  of  a  container.  And  for  simplicity  let  us  take  the  molecules 
as  small  rigid  elastic  spheres. 

To  apply  the  methods  of  ordinary  mechanics  to  such  a  gas  we  should 
have  to  know  at  some  given  instant  the  positions  and  velocities  of  all  the 
molecules.  We  could  then  foresee  the  collisions  that  were  about  to  take 
place  among  the  molecules  themselves  and  between  the  molecules  and 
walls  of  the  container,  and  by  applying  the  principles  of  the  conservation 
of  energy  and  momentum  to  these  collisions  could  predict  their  result. 
We  should  thus  theoretically  be  able  to  follow  the  further  behavior  of 
the  gas  and  predict  its  condition  at  any  future  time.  Practically,  how- 
ever, we  should  be  lost  in  our  calculations  before  the  lapse  of  a  thousandth 
of  a  second. 

The  difficulty  of  thus  using  the  methods  indicated  by  ordinary 
mechanics  led  in  the  older  kinetic  theory  of  gases  to  the  introduction 
of  various  artificial  simplifications,  which  are  not  now  necessary,  such  as 
the  assumption  that  all  the  molecules  had  the  same  velocity  distributed 
in  random  directions,  thus  neglecting  the  effect  of  collisions  between  the 
molecules  themselves  in  changing  the  velocity.  With  the  help  of  such 
simplifications  the  methods  of  ordinary  mechanics  can  lead  to  a  number 
of  valuable  approximate  results. 

To  apply  the  methods  of  statistical  mechanics  to  determine  the  behavior 
of  a  gas,  such  as  that  postulated  above,  we  do  not  try  to  investigate  the 
behavior  of  a  single  sample  of  the  gas.     Instead  we  consider  an  infinite 
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number  or  ensemble  of  samples  of  the  gas,  started  off  with  their  molecules 
having  all  possible  combinations  of  positions  and  velocities.  Even  though 
we  can  not  predict  the  precise  behavior  of  any  single  sample  of  the  gas, 
nevertheless,  surprising  as  it  may  seem,  we  are  able  by  the  application 
of  the  principles  of  mechanics  to  obtain  information  as  to  the  statistical 
behavior  of  the  whole  ensemble,  and  thus  to  draw  conclusions  as  to  the 
probable  behavior  of  a  single  sample.  Thus,  for  example,  we  find  that 
of  all  the  samples  in  the  ensemble  only  an  infinitesimal  fraction  of  them 
will  have  the  velocities  of  their  molecules  distributed  in  a  manner  appre- 
ciably different  from  that  given  by  the  Maxwell  distribution  law,  and 
hence  conclude  that  any  given  sample  of  gas  will  probably  have  the 
Maxwell  distribution  of  velocities  and  if  displaced  therefrom  will  tend 
to  assume  that  distribution  of  velocities  when  left  to  itself. 

At  first  sight  it  seems  surprising  that  the  indicated  method  should  be 
so  powerful.  A  single  sample  of  the  gas  with  its  many  molecules  was  so 
complicated  a  system  that  we  could  not  directly  predict  its  future  behavior, 
and  yet  by  going  to  the  much  more  complicated  system  of  an  infinite 
ensemble  of  similar  samples  we  appear  to  obtain  such  predictions  as  to 
behavior.  The  apparent  paradox,  however,  is  dependent  on  the  circum- 
stance that  in  the  first  case  with  the  single  sample  we  attempted  to  predict 
the  actual  behavior  of  this  single  sample ;  with  our  new  method  on  the 
other  hand  we  content  ourselves  with  a  determination  of  the  statistical 
behavior  of  the  ensemble  and  hence  a  prediction  as  to  the  probable 
behavior  of  a  single  sample. 

A  similar  situation  arises  in  the  biological  sciences.  Thus  a  given  man 
between  the  ages  of  thirty-four  and  thirty-five  is  such  a  complicated 
system  of  atoms  and  molecules  that  it  is  impossible  to  predict  whether 
or  not  he  will  die  within  the  coming  year.  Nevertheless,  in  a  given 
country  we  have  very  reliable  statistics  as  to  the  number  of  men  per 
thousand  between  the  ages  of  thirty-four  and  thirty-five  that  die  per 
annum,  and  hence  we  can  make  a  very  satisfactory  prediction  as  to  the 
probability  or  not  that  the  man  in  question  will  live  throughout  the  year. 
In  this  biological  case  the  prediction  is  based  on  statistical  laws  obtained  as 
a  result  of  observation,  while  in  the  physical-chemical  cases  which  will  be  of 
interest  to  us  the  statistical  laws  will  be  obtained  as  the  result  of  deduction 
from  the  laws  of  mechanics.  Nevertheless  the  reason  for  the  appeal 
to  statistical  methods  is  the  same  in  both  cases,  the  complexity  of  the 
systems  making  the  use  of  more  direct  methods  difficult. 

The  impression  of  course  must  hot  be  gained  that  statistical  mechanics 
can  be  profitably  applied  only  to  systems  which  are  too  complicated  to 
permit  the  application  of  ordinary  mechanical  methods.  Also  in  the  case 
of  a  system  of  only  a  few  degrees  of  freedom  governed  by  simple  equa- 
tions of  motion,  we  can  consider  an  ensemble  of  such  systems  and  discuss 
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the  statistical  behavior  of  this  ensemble.^  An  application  of  this  kind 
can  be  of  importance  in  case  the  initial  values  of  the  coordinates  and 
velocities  of  the  system  at  some  given  instant  are  unknown  or  not  of 
interest.  Nevertheless  in  the  present  book  we  shall  develop  the  applica- 
tion of  statistical  mechanics  to  physical-chemical  systems  with  many 
degrees  of  freedom  corresponding  to  their  many  molecules,  and  our  chief 
interest  will  be  the  practical  one  of  elucidating  their  behavior  by  a  method 
that  has  elements  of  superiority  over  any  other. 

3.  Comparison  of  the  Methods  of  Statistical  Mechanics  and 
Thermodynamics. — For  the  theoretical  treatment  of  the  behavior  of 
complicated  physical-chemical  systems  the  method  of  thermodynamics  has 
been  used  in  the  past  even  more  than  that  of  statistical  mechanics.  A 
comparison  of  the  nature  of  the  two  methods  and  some  estimate  of  their 
relative  advantages  will  be  of  interest. 

If  we  first  compare  the  two  sciences  as  to  the  certainty  of  their  foun- 
dation and  the  rigor  of  their  development,  the  advantage  most  certainly 
lies  with  thermodynamics. 

The  whole  of  classical  thermodynamics  is  based  entirely  upon  two 
postulates,  the  first  and  second  laws  of  energy,  and  must  be  regarded 
merely  as  the  logical  system  of  theorems  which  can  be  drawn  from  these 
postulates  by  applying  them  to  different  types  of  physical-chemical  sys- 
tems. Moreover,  the  truth  of  the  two  fundamental  laws,  at  least  as  far 
as  the  macroscopic  behavior  of  physical-chemical  systems  is  concerned, 
can  hardly  be  questioned ;  deviations  from  the  laws  have  never  been  found, 
and  the  whole  past  history  of  successful  application  furnishes  a  vindication 
of  the  starting  point.  We  must  regard  the  theorems  of  thermodynamics 
as  among  the  most  certain  possessions  of  science. 

On  the  other  hand,  when  we  turn  to  statistical  mechanics  we  find  a 
much  less  certain  foundation,  and  for  parts  of  the  science  a  less  rigorous 
method  of  development. 

The  first  underlying  assumption,  in  the  application  of  the  science,  is 
the  hypothesis  of  the  atomic  and  molecular  structure  of  matter.  This 
general  hypothesis  no  longer  has  to  withstand  the  attacks  of  the  older 
school  of  physical  chemists,  influenced  by  Ostwald's  point  of  view  that 
the  scientist  should  give  attention  only  to  observable  phenomena.  Never- 
theless, just  at  the  present  moment,  the  physicists  themselves,  greatly 
influenced  by  the  development  of  the  general  theory  of  relativity,  which 
concerns  itself  only  with  observable  space-time  coincidences,  are  tempted 
to  follow  Heisenberg's  lead  in  hoping  that  the  difficult  problems  of  the 
quantum  theory  can  be  solved  by  neglecting  the  atomic  constitution  of 
matter  and  concentrating  attention  solely  on  the  observed  relations  as  to 

'See  E.  B.  Wilson,  Annals  of  Math.,  10,  129,  149  (1909). 


2-3]  GENERAL  METHODS  21 

the  frequencies  and  intensities  of  spectral  lines.  In  any  case,  even  though 
we  feel  justified  in  accepting  as  one  of  our  starting  points  the  general 
theory  of  atomic  structure,  we  shall  find  in  the  course  of  our  development 
of  statistical  mechanics  many  times  a  necessity  for  specific  hypotheses  as 
to  the  precise  nature  of  atoms  and  molecules,  which  at  the  best  can  be 
regarded  only  as  necessary  simplifying  assumptions. 

Having  started  with  the  hypothesis  of  the  atomic  structure  of  matter, 
the  method  of  statistical  mechanics  then  consists  in  applying  the  laws  of 
dynamics  to  determine  the  behavior  of  an  ensemble  of  systems,  each 
system  itself  composed  of  atoms  and  molecules.  In  the  time  of  Gibbs 
and  Boltzmann  there  would  have  been  few  to  question  this  procedure. 
The  laws  of  dynamics,  as  codified  for  example  in  the  form  of  Hamilton's 
principle,  are  certainly  a  valid  abstraction  drawn  from  actual  experiments 
on  the  behavior  of  moving  bodies  of  macroscopic  size,  and  the  danger 
of  applying  these  same  laws  to  the  motions  of  molecules  was  then  but 
dimly  appreciated.  Today,  however,  the  development  of  quantum  theory 
has  indicated  that  the  most  satisfactory  treatment  of  atomic  and  molecular 
behavior  may  involve  a  complete  revision  of  the  laws  of  dynamics  and 
perhaps  even  of  our  notions  as  to  the  continuity  of  space  and  time. 

In  the  present  book  we  shall  try  to  solve  this  difficulty  by  first  develop- 
ing what  may  be  called  the  classical  statistical  mechanics  based  on  the 
older  laws  of  dynamics.  We  shall  then  regard  this  as  a  limiting  case, 
valid  when  high  frequencies  of  periodic  motion  are  not  involved,  and 
shall  proceed  by  introducing  somewhat  arbitrarily  such  modifications  as 
seem  most  suitable  for  the  treatment  of  the  great  group  of  quantum 
phenomena. 

One  further  lack  of  rigor  in  the  method  of  statistical  mechanics  must 
be  mentioned.  Very  early  in  the  development,  even  of  its  classical  appli- 
cations to  physical-chemical  systems,  we  are  forced  to  introduce  the 
famous  ergodic  hypothesis  or  principle  of  continuity  of  path.  The  intro- 
duction of  this  hypothesis  is  certainly  justified  by  its  results,  and  the 
reasons  for  its  introduction  in  a  suitably  modified  form  can,  as  we  shall 
see,  be  made  plausible.  Nevertheless,  this  hypothesis  still  presents  a 
bafflng  problem  for  further  study. 

This  concludes  our  statement  of  the  sources  of  lack  of  rigor  in  the 
method  of  statistical  mechanics.  They  are :— first,  the  assumption  of  the 
atomic  hypothesis  itself,  not  so  much  the  general  assumption  of  the  atomic 
structure  of  matter  as  specific  assumptions  as  to  atomic  properties  which 
must  be  made  from  time  to  time  in  the  course  of  the  development ;  second, 
the  application  of  classical  dynamics  to  describe  the  behavior  of  atomic 
systems,  arbitrarily  modified  to  take  account  of  quantum  phenomena 
rather  than  revivified  by  the  newer  dynamics  which  is  yet  to  come,  even 
though   already   heralded   by   birth   pangs;   and   third,    the   introduction 
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of  the  ergodic  hypothesis  which  has  to  be  made  in  order  to  obtain  the 
important  appHcations  to  physical-chemical  systems  that  we  have  in  mind. 
It  should  be  remarked  that  this  lack  of  rigor  is  not  resident  in  the 
science  of  statistical  mechanics  per  se,  but  only  arises  when  we  come  to 
actual  applications  to  the  problems  of  physics  and  chemistry.  Never- 
theless, we  are  interested  in  comparing  the  methods  of  thermodynamics 
and  statistical  mechanics  as  applied  to  actual  physical-chemical  systems, 
and  for  this  purpose  there  is  no  question  that  thermodynamics  at  the 
present  time  is  superior  as  to  rigor. 

4.  If  we  turn  our  attention  now,  however,  to  the  results  that  can  be 
achieved  by  the  two  methods,  the  balance  is  all  in  the  favor  of  the  newer 
science. 

In  the  first  place,  as  will  be  shown  in  a  later  chapter  of  the  present 
book,  the  two  laws  of  thermodynamics  may  themselves  be  regarded  as 
derivable  from  the  principles  of  statistical  mechanics  and  interpretable  as 
the  necessary  result  of  atomistics.  Hence  all  the  possible  conclusions  of 
thermodynamics  can  in  this  way  at  least  be  indirectly  obtained  from 
statistical  mechanics,  although  there  are  often  advantages  in  their  direct 
derivation  since  the  theorems  then  obtain  a  more  intimate  atomistic  inter- 
pretation. 

In  the  second  place,  the  properties  of  matter  enter  the  equations  of 
thermodynamics  merely  as  empirical  parameters  whose  values  have  to  be 
determined  by  actual  measurements  applied  to  macroscopic  systems.  In 
the  treatments  of  statistical  mechanics,  however,  the  actual  values  of  these 
quantities  are  often  deducible  from  their  atomistic  interpretation.  Thus, 
for  example,  in  thermodynamic  deductions  specific  heats  of  gases  and 
solids,  heats  of  reaction,  and  equilibrium  constants  enter  the  equations  and 
are  related  one  with  another.  Their  absolute  values  remain,  however, 
unknown.  In  statistical  mechanical  treatments,  however,  values  of  specific 
heats  can  be  obtained  from  a  consideration  of  the  number  of  degrees  of 
freedom  of  the  system  in  question  and  the  extent  of  their  excitation ; 
values  of  heats  of  reaction  are  gradually  being  related  to  the  energy  levels 
of  atoms  and  molecules  as  deteiTnined  from  spectroscopy ;  and  absolute 
values  of  equilibrium  constants  are  obtained  by  considering  the  relative 
probability  of  different  arrangements  of  the  atoms  to  form  molecules.^ 

In  the  third  place,  even  in  the  field  of  equilibrium  relations,  for 
the  treatment  of  which  thermodynamics  is  specially  fitted,  laws  arise  which 
thermodynamics  must  use  but  can  not  derive.  Thus  in  particular  the 
gas  laws  and  the  equations  of  state  for  matter  in  any  form  can  for  thermo- 

*  The  third  law  of  thermodynamics  essays  this  same  task  of  determining  absolute 
values  of  equilibrium  constants.  The  correct  formulation  of  the  third  law, 
however,  involves  statistical  mechanical  interpretation. 
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dynamics  be  only  empirical  facts,  but  for  statistical  mechanics  they  are 
already  either  deduced  or  in  any  case  deducible  by  methods  that  we  can 
now  foresee. 

In  the  fourth  place,  for  a  system  which  is  not  in  equilibrium,  thermo- 
dynamics can  only  tell  us  that  those  changes  are  permitted  which  involve 
an  increase  in  entropy.  It  can  give  us  no  information  as  to  the  velocities 
with  which  the  permitted  changes  will  take  place  and  hence  in  actuality 
usually  gives  us  no  prediction  as  to  what  will  really  happen  in  a  system 
which  is  not  in  equilibrium.  Thus  such  questions  as  rates  of  diffusion 
and  evaporation,  rates  of  thermal  and  electrical  conduction,  rate  of  momen- 
tum transfer,  and  rates  of  thermal  and  photochemical  reactions  all  lie 
beyond  the  power  of  thermodynamic  reasoning.  They  are,  however,  as 
we  shall  see,  in  the  present  book,  amenable  to  statistical  mechanical  treat- 
ment. 

Finally,  when  phenomena  arise  which  are  essentially  atomistic  in 
character,  thermodynamics  can  of  course  have  absolutely  no  help  to  give. 
The  Brownian  motion  of  particles,  the  fluctuations  in  the  density  of  a 
fluid,  the  differences  in  the  velocities  of  electrons  from  a  hot  filament, 
and  the  distribution  of  thermal  radiation  among  its  different  frequencies 
all  furnish  problems  which  statistical  mechanics  may  treat,  but  which  a 
non-atomistic  thermodynamics  must  actually  fail  to  recognize. 

This  long  recital  does  not  exhaust  the  achievements  of  statistical 
mechanics.  It  will  give  some  idea,  however,  of  the  power  of  the  method 
and  the  scope  of  its  attack.  The  future  of  theoretical  chemistry  is 
dependent  on  its  applications  and  there  will  be  a  mutual  and  advantageous 
interplay  in  the  development  of  these  two  sciences. 


Chapter  2 
The  Elements  of  Classical  Mechanics 

5.  Introduction. — In  the  following  brief  chapter  we  shall  present 
some  of  the  principles  of  classical  dynamics  which  will  be  necessary  for 
an  understanding  of  the  foundations  of  statistical  mechanics. 

Any  general  treatment  of  dynamics  is  of  course  beyond  the  scope 
of  a  single  short  chapter.  We  shall  try,  however,  in  the  first  place  to 
present  the  major  part  of  what  is  necessary  for  future  reference,  and  in 
the  second  place  to  give  a  correct  feeling  as  to  the  fundamental  nature 
of  the  methods  employed  in  dynamical  investigations. 

6.  Generalized  Coordinates  and  Velocities. — The  essence  of  any 
dynamical  problem  always  consists  in  the  calculation  of  the  future  condi- 
tion of  some  system  of  interest  from  a  knowledge  of  its  initial  condition 
at  some  chosen  time.  The  condition  of  the  system  at  any  instant  can 
always  be  given  by  a  statement  of  the  coordinates  which  determine  the 
positions  of  its  parts  and  the  velocities  which  determine  the  rate  of  change 
of  these  coordinates  with  the  time. 

In  the  case  of  a  system  of  point  particles,  or  bodies  which  can  be 
treated  as  such,  it  will  often  be  natural  to  specify  the  positions  of  the  parts 
of  the  system  by  giving  the  values  of  the  Cartesian  coordinates  x,  y  and  2 
for  each  particle,  and  their  velocities  by  giving  the  values  of  the  com- 
ponent velocities  x,  y  and  i  for  each  particle. 

In  many  cases,  however,  some  choice  of  other  than  Cartesian  coordi- 
nates for  the  specification  of  positions  may  be  more  convenient  or  natural. 
Thus  for  the  position  of  a  swinging  pendulum  bob,  it  is  more  satisfactory 
to  give  the  single  angle  1^  which  the  pendulum  makes  with  the  vertical, 
than  to  give  the  values  of  the  two  Cartesian  coordinates  x  and  y  which 
could  also  be  used  to  determine  its  position.  For  an  electron  rotating 
in  a  circular  hydrogen  orbit,  the  angle  4>  niade  by  the  radius  vector  from 
nucleus  to  electron  is  the  natural  coordinate.  And  for  an  electron,  mov- 
ing inside  an  atom  subjected  to  an  external  electric  field  as  in  the  Stark 
effect,  the  parabolic  coordinates  furnished  by  a  system  of  parabolse  with 
the  nucleus  at  their  focus  and  their  axes  parallel  to  the  field  are  found  to 
be  convenient. 

24 
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Such  variables  are  called  generalized  coordinates  and  may  be  chosen 
in  a  great  variety  of  ways  depending  on  their  convenience  for  the  problem 
at  hand.  Corresponding  to  each  of  these  coordinates  will  be  a  gencraliacd 
velocity,  giving  the  rate  of  change  with  the  time  of  the  particular  coordi- 
nate in  question.  For  the  generalized  coordinates  of  a  system  we  shall 
use  the  letters  q^  q-^  .  .  .  .  qn,  and  for  the  corresponding  generalized  veloc- 
cities  the  symbols  ^1^2  .  .  •  •  qn-  It  has  been  one  of  the  most  useful 
achievements  of  dynamics  to  obtain  its  laws  in  forms  which  are  inde- 
pendent of  the  particular  nature  of  the  coordinates  that  are  employed. 

7.  The  number  of  coordinates  chosen  must  of  course  be  at  least 
sufficient  for  the  complete  specification  of  the  configuration  of  the  system, 
and  it  is  most  convenient  when  it  does  not  exceed  that  number.  If  the 
coordinates  so  chosen  are  all  of  them  capable  of  independent  displacement 
without  violating  the  constraints  of  the  structure  the  system  is  said  to  be 
holonomic  and  the  number  of  coordinates  is  called  the  tiumber  of  degrees 
of  freedom  of  the  system.  If  the  minimum  number  has  been  chosen  and 
they  are  not  all  capable  of  independent  variation,  but  their  displacement 
connected  by  non-integrable  relations/  the  system  is  said  to  be  non- 
holonomic.  The  number  of  degrees  of  freedom  is  then  equal  to  the 
number  of  coordinates  capable  of  independent  displacement. 

8.  Hamilton's  Principle.— The  search  for  a  single  general  dynamical 
principle  which  would  apply  to  the  motions  of  any  system  has  had  a  con- 
siderable history,  which  starts  with  the  theological  arguments  of  Mauper- 
tuis  in  favor  of  the  principle  of  least  action  in  1744,  includes  the  general 
formulation  of  this  principle  by  Lagrange  in  1760-61,  and  the  introduction 
of  the  principle  of  least  curvature  by  Gauss  in  1829,^  and  having  found  a 
certain  temporary  culmination  in  the  discovery  of  Hamilton's  principle  in 
1834,  is  perhaps  still  continuing  at  the  present  time  in  the  investigation 
of  different  variation  principles  suitable  for  use  in  the  general  theory  of 
relativity. 

For  the  needs  of  the  present  book  we  shall  take  Hamilton's  principle 
as  the  starting  point  for  our  discussions.  For  ordinary  mechanical  sys- 
tems all  the  different  principles  lead  to  the  same  conclusions,  but  Hamil- 
ton's principle  seems  to  be  the  most  satisfactory  formulation  for  our 
purposes.  The  principle  is  to  be  regarded  as  a  postulate,  justified  by  the 
correspondence  between  experimental  facts  and  the  conclusions  that  can 
be  deduced  from  it. 

^If  the  relations  were  integrable  a  reduction  in  the  number  of  coordinates 
chosen  would  be  possible. 

^  Later  developed  by  Hertz. 
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9.  Hamilton's  principle  can  be  stated  in  the  form 

dCldt^O  (1) 

The  equation  applies  to  the  motion  of  any  conservative  system  ^  from  the 
configuration  it  has  at  time  ^i  to  the  configuration  it  assumes  at  time  fz- 
The  quantity  L  is  the  so-called  Lagrangian  function  of  the  generalized 
coordinates  and  velocities 

L  =  L  (qiq2  ..  .  qnqiq2.  • .  qn)  (2) 

and  the  sign  of  variation  indicates  that  the  integral  (  Ldt  has  a  minimum  * 

value  for  the  actual  path  by  which  the  system  travels  from  its  initial  to 
its  final  configuration,  as  compared  with  any  neighboring  slightly  dis- 
placed path  by  which  it  could  be  made  to  go  in  the  same  time  interval 
from  the  initial  to  the  final  point.  The  requirement  that  the  time  interval 
should  not  be  changed  means  that  the  limits  ^  and  /g  are  to  be  kept  con- 
stant in  performing  the  indicated  variation. 

The  principle  is  a  wonderfully  compact  formulation  to  serve  as  the 
single  axiom  for  an  immense  body  of  science.  It  has,  however,  the 
disadvantage  that  it  is  difficult  to  get  an  intuitional  grasp  of  its  signifi- 
cance and  inevitability  as  seems  to  be  possible,  for  example,  in  the  case 
of  the  laws  of  the  conservation  of  energy  and  momentum,  and  the  law 
of  the  increase  of  entropy. 

10.  Applications  of  the  principle  can  of  course  not  be  made  until  we 
have  information  as  to  the  way  the  Lagrangian  function  L  depends  on 
the  generalized  coordinates  and  velocities  (gi  .  .  .  .  g„)  as  indicated  by 
equation  (2).  For  ordinary  mechanical  systems  involving  low  velocities, 
the  Lagrangian  function  is  the  difference  between  the  kinetic  and  potential 
energies  of  the  system 

L  =  T—V  (3) 

For  a  system  of  particles  involving  high  velocities  of  the  order  of  the 
speed  of  light,  such  as  are  considered  in  the  special  theory  of  relativity, 
the  Lagrangian  function  has  the  value 

L  =  Hmoc'il—  VI  —  u'/c')  —  V  (4) 

where  >%  is  the  mass  of  a  particle  at  rest,  c  is  the  velocity  of  light,  ii  the 

'  Systems  which  are  not  conservative  owing  to  the  action  of  external  forces 
can  always  be  made  so  by  including  in  the  system  "reservoirs"  of  potential  energy 
which  will  exert  these  forces. 

*  For  extended  motions  J' Ldt  is  not  necessarily  a  minimum. 
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velocity  of  the  particle  and  the  summation  is  to  be  taken  over  all  the 
particles  of  the  system. 

For  other  systems  including  non-mechanical  ones,  the  Lagrangian 
function  may  assume  a  variety  of  forms.  It  should  be  regarded  as  a 
matter  of  experimental  science  to  determine  the  actual  form  of  the  func- 
tion L  for  any  particular  application.  This  can  indeed  be  considered  one 
of  the  major  problems  of  experimental  physics  which  has,  however,  been 
successfully  solved  for  a  wide  variety  of  phenomena. 

11.  It  may  seem  somewhat  surprising  that  the  Lagrangian  function 
should  contain  only  the  generalized  coordinates  and  their  first  derivatives 
with  respect  to  the  time,  since  this  would  indicate  that  the  future  motion 
of  a  system  is  never  affected  by  the  initial  values  of  the  accelerations  or 
higher  derivatives.  In  the  case  of  a  system  of  moving  electrons,  for 
example,  this  would  seem  to  lead  to  difficulties,  since  we  know  that  the 
classical  theory  makes  the  emission  of  energy  dependent  on  the  accelera- 
tions of  the  electrons  and  hence  that  we  should  not  be  able  to  predict 
their  further  motion  solely  from  a  knowledge  of  their  initial  positions 
and  velocities.  The  problem  of  a  system  of  moving  electrons  is,  never- 
theless, made  amenable  to  the  principles  of  dynamics  by  the  device  of 
including  in  the  system  to  be  considered  the  energy  of  the  electromagnetic 
fields  surrounding  the  electrons,  and  hence  including  in  the  Lagrangian 
function  L  what  amounts  to  a  coordinate  and  a  velocity  for  all  three  direc- 
tions at  every  point  in  space.  Since  this  makes  the  Lagrangian  function 
contain,  as  well  as  the  coordinates  and  velocities  for  the  electric  charges, 
an  infinite  number  of  further  coordinates  and  velocities,  it  is  not  surpris- 
ing that  the  treatment  then  becomes  possible. 

The  possibility  of  throwing  an  exceedingly  wide  variety  of  problems 
into  the  form  demanded  by  dynamics  may  in  part  be  due  to  the  great 
flexibility  that  is  open  to  us  in  the  choice  of  the  form  of  the  Lagrangian 
function  L,  as  illustrated  by  the  example  just  cited.  The  advantages, 
however,  in  treating  as  many  phenomena  as  possible  by  equations  having 
I  the  same  mathematical  form  will  of  course  be  apparent. 

12.  Derivation  of  the  Equations  of  Motion  in  the  Lagrangian 
Form. — From  Hamilton's  principle  we  may  now  proceed  to  obtain  the 
equations  for  the  motion  of  a  system,  first  in  the  Lagrangian  form  and 
then  in  the  canonical  form  due  to  Hamilton  himself.  These  forms  of  the 
equations  can  be  made  the  starting  point  for  all  of  dynamics  that  we  shall 
need  to  use. 

To  obtain  the  Lagrangian  equations  from  Hamilton's  principle,  we 
have  merely  to  perform  the  indicated  variation  given  in  equation  (1). 
We  obtain 
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where  dqi  and  dqi  are  the  variations  in  the  vaUies  of  the  fth  coordinate 
and  velocity  in  changing  from  the  actual  to  a  neighboring  displaced  path. 
The  variation  is  carried  out  solely  behind  the  integral  sign  since  Hamilton's 
principle  requires  that  the  time  of  passage  shall  not  be  changed. 

The  second  term  behind  the  integration  sign  can  easily  be  transformed 
by  substituting 

^qi^qt  +  ^qi  —  Qi  =-^(^i  -\-dqi)  —  -^  =-^^  (dgt)  (6) 

giving  us 

which  on  integration  by  parts  gives 

The  first  of  the  two  terms,  however,  is  zero,  since  by  hypothesis  the 
configurations  are  the  same  in  the  actual  and  varied  motions  at  the  initial 
and  final  times  ti  and  to,  so  that  we  finally  obtain 

This  expression  can  now  be  profitably  substituted  in  equation  (5)  to 
give  an  expression  which  no  longer  contains  the  variations  of  the  velocities ; 
we  obtain 

tx 

For  holonomic  systems,  however,  the  different  variations  dqi  are  en- 
tirely arbitrary,  and  furthermore  the  Hmits  of  integration  ti  and  tz  are  also 
arbitrary,  hence  the  truth  of  equation  (8)  can  be  preserved  in  general 
only  if  we  have  for  each  of  the  n  expressions 

L^—^=0  (9) 

dt  dqi       dqi 
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These  are  the  equations  of  motion  in  the  Lagrangian  form  for  a  holonomic, 
conservative  system.^ 

13.  In  applying  these  equations  to  a  system  in  which  L  is  the  differ- 
ence between  the  kinetic  energy  T  or  some  other  function  solely  of  the 
velocities,  and  the  potential  energy  V  given  solely  as  a  function  of  the 
coordinates,  we  can  write 


dL  _       dV 


(10) 


and  regard  this  change  of  potential  energy  with  the  coordinate  as  a  gen- 
eralised force.  We  are  then  at  liberty  if  we  desire  to  take  any  part  of 
this  generalized  force  Qi,  which  it  may  be  convenient  to  think  of  as  due 
to  external  parts  of  the  system,  and  transpose  to  the  other  side  of  the 
equation,  giving  us 

i.Jk_^  =  Q,  (11) 

dt  dqi         dqi 
where  the  quantities  Qi  are  the  external  generalized  forces. 

14.  Definition  of  the  Generalized  Momenta  and  the  Hamiltonian 
Function. — We  may  next  define  the  generalized  momenta  pi  ....  pn 
corresponding  to  the  n  generalized  coordinates  and  velocities  by  the 
equations 

Pi  =  ^  0=1.2.3 n)  (12) 

oqi 

For  the  case  of  a  particle  whose  position  is  given  by  Cartesian  co- 
ordinates, these  generalized  momenta  are  readily  seen  to  become  identical 
with  the  ordinary  components*  of  momentum  inx,  my,  mi  for  slow 
velocities    (see  equation  3),  and  equal  to  the  relativity  expressions   for 

momentum  °'    =  etc.,  for  high  velocities  (see  equation  4). 

V 1  —  u^/c"^ 
We  may  further  define  the  Hamiltonian  function  H  by  the  equation 

H  =  piqi  +  ....^pnqn  —  L  (13) 

'  For  a  non-holonomic  system  the  variations  bqi  would  not  all  be  arbitrarily 
independent,  and  the  treatment  would  have  to  be  changed.  It  is  possible,  however, 
to  reduce  non-holonomic  systems  to  holonomic  ones  by  introducing  forces  which 
do  no  work,  but  preserve  the  postulated  lack  of  freedom  in  the  arbitrary  dis- 
placement of  the  coordinates.  As  remarked  above,  systems  which  are  not  con- 
servative because  of  the  action  of  external  forces  can  be  made  conservative  by 
including  in  the  system  "reservoirs"  of  potential  energy  which  will  exert  these 
forces.  Hence  Lagrange's  equation  can  be  regarded  as  having  a  very  wide  applica- 
tion. 
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15.  Derivation  of  the  Equations  of  Motion  in  the  Canonical 
Form. — Making  use  of  these  definitions,  we  are  now  ready  to  derive  the 
equations  of  motion  in  the  Hamiltonian  or  so-called  canonical  form. 

Differentiating  equation  (13)  we  obtain 

dH  =       pidqi      -f +  pndqn 

+  qidpi      -\-  ....  +  qndpn 

^^   ^  ^^    ^ 

—  'Z —  "^1  —    ....  —  -Q —  aqn 
oqi    ^  aqn 

dL  ^.  dL    ^. 

—  -^7-  dqi  —  ....  —  -^7-  aqn 
oqi    ^  aqn 

and  referring  to  equations  (12)  we  see  that  the  first  and  last  lines  cancel 
giving  us 

dH  =       qidpi    + +  qndpn 

^L  .  dL    .  , ,  . . 

-w.'''"~ ~W.^^"  ^^^^ 

This  dififerential  form  shows  that  H  can  be  obtained  solely  as  a  function 
of  the  g's  and  p's,  the  coefficients  of  the  differentials  in  (14)  being  the 
partial  derivatives  oi  H  with  respect  to  these  quantities,  in  accordance 
with  the  expressions 

dH       .  ^        dH  dL  ,,-- 

■T— -  =  gi         and        -,— =  —  -^—  (15) 

dpi       ^  dqi  dqi 

.     dL 

The  quantity-^ — ,  however,  in  accordance  with  Lagrange's  equation   (9) 
dqi 

and  the  definitions  (12)  is  given  by 

dqi        dt  dqi        dt 
So  that  equations  (15)  can  now  be  written  in  the  symmetrical  form 
dH  ^  dH       . 

Wi=-^'    '       Wr''  0=1.2.3.. ..,0        ^  (16) 

where  H  is  to  be  taken  as  a  function  of  the  g's  and  p's. 

These  are  the  equations  of  motion  in  the  canonical  form.  A  part  of 
their  importance  for  dynamical  investigations  arises  from  the  circum- 
stances that  we  have  in  this  manner  replaced  the  n  second-order  dififer- 
ential  equations  of  Lagrange  by  2  n  first-order  equations,  which  may 
facilitate  the  integration.  These  equations  will  form  the  starting  point 
for  our  investigation  of  the  properties  of  statistical  ensembles. 
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16.  Interpretation  of  the  Hamiltonian  Function  as  the  Energy 
o£  the  System. — The  quantity  H  which  was  originally  defined  by  equa- 
tion (13)  can  now  be  shown  to  have  the  important  property  of  being 
a  constant  independent  of  time.  To  demonstrate  this  we  may  evidently 
write 

dH       dH  .     ,  ^    dH  .      ,    dH  ,     ,  ,    dH  ^ 

■rfT"  ^  %  ^^ ^ +  ••••+  6^  ^ "  + ^  ^ ^ +  ••••  + ^  ^ '' 

and  by  substitution  of  the  canonical  equations  (16)  obtain 
dH 

-^  =—  ^1^1  —  •  •  •  —  Pniln  +  Qlpl  + +  qnpn 

=0  (17) 

We  have  thus  obtained  one  integral  of  the  equations  of  motion, 
namely 

H  =  const.  (18) 

This  quantity  whose  value  is  conserved  throughout  the  motion  of 
the  system  may  now  be  defined  as  the  total  energy  of  the  system  (ex- 
pressed of  course  as  a  function  of  the  q's  and  />'s),  and  from  our  pres- 
ent point  of  view  this  is  the  most  natural  manner  for  the  concept  of  energy 
first  to  enter  into  our  considerations. 

For  a  system  of  particles  moving  with  low  velocities  where  L  in 
accordance  with  equation  (3)  is  the  difference  between  the  kinetic  and 
potential  energies  it  is  easy  to  show  that  H  is  equal  to  the  ordinary 
expression  for  energy,  since  we  can  write 

L  =  T—V  =  ^imiqi^—V 
i 

^*  =  ^  =  ^*^* 

and  substituting  in  equation  (13)  obtain 

H  =  ^Miq,'  —  2  ^  nnqi'  -{- V  =  T  +  V  =  E  (19) 

i  < 

It  is  to  be  noted,  however,  that  the  Hamiltonian  H  is  for  all  systems  the 
energy,  expressed  as  a  function  of  the  coordinates  and  momenta,  even 
when  the  Lagrangian  function  does  not  assume  the  form  T  —  F  of  ordi- 
nary slow  velocity  mechanics. 

This  brief  treatment  of  dynamics  is  all  that  will  be  necessary  for 
our  further  development  of  statistical  mechanics. 


Chapter  3 
The  Properties  of  Statistical  Ensembles 

17.  Introduction.  Ensemble  and  Phase. — We  are  now  ready  to 
proceed  with  the  development  of  statistical  mechanics.  As  already  stated, 
we  shall  not  discuss  the  behavior  of  a  single  system,  but  rather  the 
behavior  of  a  collection  or  ensemble  of  systems,  containing  an  enormous 
number  of  sample  systems  identical  in  nature  with  the  one  of  interest. 
From  the  behavior  of  this  collection  of  samples  we  shall  then  be  able 
to  draw  conclusions  as  to  the  probable  behavior  of  a  single  system. 

The  different  systems  in  the  ensemble  will  be  of  the  same  structure, 
that  is,  composed  of  the  same  numbers  of  atoms  of  each  of  the  differ- 
ent kinds  enclosed  in  similar  containers,  but  will  differ  in  the  positions 
and  velocities  of  their  component  elements.  In  the  language  introduced 
by  Gibbs  they  will  be  said  to  differ  in  phase. 

If  each  system  in  the  ensemble  has  m  generalized  coordinates 
Qi  .  .  .  .  Qm  and  the  corresponding  m  momenta  Pi  .  .  .  Pm,  then  the 
instantaneous  phase  of  a  system  could  be  specified  by  giving  the  momen- 
tary values  of  these  m  coordinates  and  momenta.  The  reason  for  choosing 
the  generalized  coordinates  and  momenta  rather  than  coordinates  and 
velocities  will  later  become  apparent. 

18.  Phase  Spaces.^ — To  follow  the  behavior  of  such  a  system,  that 
is,  its  change  in  phase  with  time,  it  is  very  convenient  to  think  of  its 
phase  as  given  by  the  position  of  a  representative  point  (phase  point) 
in  a  2  w-dimensional  space  (phase  space)  corresponding  to  the  2  m  co- 
ordinates and  momenta  whose  values  are  to  be  specified.  As  time  pro- 
ceeds, the  phase  point  for  a  given  system  will  describe  a  trajectory  in 
this  2  m-dimensional  phase  space,  and  the  phase  points  for  all  the  sys- 
tems of  an  ensemble  can  be  regarded  as  describing  stream  lines  in  the 
space.  This  device  will  have  the  advantage  of  providing  a  ready  made 
geometrical  language  for  discussing  the  behavior  of  the  ensemble. 

If  our  system  is  composed  of  molecules,  it  will  also  often  be  con- 
venient to  represent  the  values  of  the  coordinates  and  momenta  for  a 
single  molecule,  in  a  phase  space  of  a  smaller  number  of  dimensions 
than  that  necessary  for  the  whole  system.    Thus  the  phase  of  a  single  mole- 

32 


17-19]  STATISTICAL  ENSEMBLES  33 

cule  having  the  2  n  coordinates  and  momenta  qi  ....  Qn  pi  ....  pn 
could  be  represented  by  the  position  of  a  point  in  a  space  of  2  w  dimensions. 

In  order  to  avoid  the  possibiHty  of  confusing  these  two  different  phase 
spaces,  it  will  sometimes  be  convenient  to  employ  the  nomenclature  of 
Ehrenfest  and  call  the  phase-space  used  for  representing  the  condition 
of  the  whole  system  or  "gas"  a  y-space,  and  the  phase  space  for  a  single 
molecule  a  //-space. 

If  our  system  can  be  regarded  as  built  up  from  N  monatoniic  atoms, 
the  /-space  would  have  2  w  =  6  N  dimensions,  and  the  //-space  for  one 
atom  would  have  2  n  =  6  dimensions.  Furthermore,  the  6  A'^-dimensional 
y-space  could  evidently  be  regarded  as  composed  of  the  N  dififerent 
6-dimensional  spaces  belonging  to  the  individual  atoms. 

19.  Distribution  o£  the  Ensemble  in  Phase. — Each  system  in  the 
ensemble  will  give  us  a  point  in  the  y-space.  We  may  initially  dis- 
tribute these  points  in  any  way  we  desire,  and  then  follow  their  motion 
as  they  describe  stream  lines  in  this  space.  We  shall  define  the  density 
of  distribution  q  at  any  position  in  the  phase  space  as  the  number  of 
points  per  unit  2  iH-dimensional  volume  at  the  position  in  question. 

Different  methods  of  initially  distributing  the  points  in  the  phase 
space  can  be  used  for  different  purposes.  The  three  distributions  that 
have  been  most  employed  are  the  canonical  ensemble  introduced  by  Gibbs 
and  used  by  him  and  others  ^ 

■^  —E 

Q  =  Ne~e-  (20) 

where  A''  is  the  total  number  of  systems  in  the  ensemble,  E  is  the  energy 
of  a  single  system,  and  IF  and  0  are  constants  having  the  dimensions  of 
energy ;  the  microcanonical  ensemble  used  by  Gibbs,  Boltzmann,  Jeans 
and  others 

Q  z=z  const.     (When  the  energy  lies  between  E  and  E  -\-  dE)     ,^.. 
q:=  0  (For  other  values  of  the  energy)  ^     / 

and  the  surface  ensemble  used  by  Ehrenfest,  in  which  the  phase  points 
are  distributed  on  a  surface  in  the  /-space  of  constant  energy,  with  the 
surface  density 

const. 

^==  (22) 


my^ Hm'- 


All  three  of  these  distributions  can  be  used  for  drawing  conclusions 
as  to  the  probable  behavior  of  a  single  system. 

The  surface  ensemble  would  at  first  sight  seem  to  be  the  most  suit- 
able for  this  purpose,  since  all  the  systems  of  the  ensemble  could  be 
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given  the  same  energy  as  that  of  the  system  of  interest.  From  the  point 
of  view  of  mathematical  simphcity,  however,  the  surface  distribution 
is  not  as  easy  to  handle  as  the  others,  and  we  shall  not  employ  it  in  the 
present  book. 

From  many  points  of  view,  the  canonical  ensemble  of  Gibbs  is  mathe- 
matically the  simplest  and  theoretically  the  most  illuminating.  Its  use, 
for  predicting  the  behavior  of  a  single  system,  is  dependent  on  the  fact 
that  most  of  the  systems  in  the  ensemble  will  be  found  to  have  an  energy 
sensibly  equal  to  the  average  energy  for  all  the  systems  in  the  ensemble, 
the  fluctuations  in  energy  being  of  the  order  that  might  be  expected  for  a 
system  placed  in  a  thermostat.  Hence  a  system  picked  at  random  from 
the  ensemble  may  be  regarded  as  a  fair  sample  of  a  single  system  left 
to  itself.  In  Chapter  23  we  shall  consider  some  of  the  properties  of  this 
distribution. 

The  microcanonical  ensemble  is  the  one  which  has  been  most  employed, 
and  we  shall  find  it  the  most  satisfactory  for  our  purpose.  By  taking 
the  energy  range  dE,  within  which  the  phase-points  are  distributed  in 
the  y-space,  as  small  as  we  desire,  we  can  give  all  the  systems  sensibly 
the  same  energy,  and  shall  find  that  we  can  then  regard  a  system,  picked 
at  random  from  this  ensemble,  as  a  fair  sample  of  a  single  system  left 
to  itself  with,  an  energy  content  within  this  small  range.  Most  of  our 
later  work  will  be  based  on  the  behavior  of  microcanonical  ensembles. 

20.  Change  in  Density  o£  Distribution  with  the  Time.  Liou- 
ville's  Theorem. — -We  must  now  investigate  for  any  arbitrary  distribu- 
tion of  the  density  ^  as  a  function  of  the  Q's  and  F's,  the  change  of  the 
density  q  with  the  time. 

Let  us  consider  some  particular  cubical  element  dQi  .  .  .  dQ^dPi  .  .  .  dPm 
in  the  2  «2-dimensional  phase  space  and  let 

A^  =  QdQ, dQmdP, dPr,^  (23) 

be  the  number  of  phase  points  within  that  element  of  volume.  In  gen- 
eral the  number  of  points  within  the  element  will  be  changing  with  the 
time,  since  the  phase  points  will  be  moving  in  accordance  with  the  laws 
of  motion  governing  the  behavior  of  the  systems,  and  a  different  num- 
ber of  points  may  enter  any  face  of  the  cube  than  leave  by  the  opposite 
face.  If  we  consider  the  two  parallel  faces  of  the  cube  perpendicular 
to  the  Qi  axis,  each  having  the  area  dQ2  ....  dPmdPi  ....  dPm  the  num- 
ber of  points  entering  the  cube  through  the  first  face  per  second  will 
evidently  be  QQidQo ....  dOmdPi  ....  dPm  and  the  number  leaving  through 
the  opposite  face 


(.  +  |-^<).)(0.  +  ||<'(?.)^Q= 


dQ,n  dP,....  dP, 
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Hence  considering  all  the  pairs  of  parallel  faces,  we  can  write  for  the 
total  rate  of  change  of  the  number  of  points  within  the  cube 

^  =  QQidQ, rfF„,  _  (^  +  ^/Ox)  (Ox  +  f§^  ^Oi)  d.Q,....  dP^ 

+  QP,ndQ,  ....  dPm-^  -  (q  +^  ^^-)(^-  +  1^  ^P-)  dQ,....  dPm-r 

or  simplifying  and  omitting  second-order  terms 

We  have,  however,  from  the  canonical  equations  of  motion  (16)  Chapter  2, 
the  relations 

Mi-J-^     and     ^  =  A/     ^\     etc  (24) 

and  since  the  order  of  differentiation  is  immaterial  can  see  that  pair  by 
pair  all  the  terms  in  the  first  parenthesis  will  cancel,  and  hence  since 
dN/dt  divided  by  the  element  of  volume  dQx  ....  dPm  is  the  rate  of 
change  of  the  density  q,  we  can  write 

for  the  rate  at  which  the  density  of  distribution  is  changing  at  any  point 
in  our  phase  space. 

21.  It  is  immediately  evident  that  this  rate  will  be  zero  if  the  points 
themselves  are  uniformly  distributed,  since  the  quantities  dp/OQi,  etc., 
will  then  all  individually  be  zero.  Furthermore,  since  a  given  system 
cannot  change  its  energy  in  the  course  of  its  motion,  it  is  evident  that 
the  density  of  distribution  will  remain  unchanged  with  time  if  the  phase 
points  are  distributed  as  any  function  of  the  energy,  but  uniformly  in 
each  region  between  the  limits  E  and  E  -\-  dE.    So  that  we  may  write 

1  =  0  (26) 

whenever  the  distribution  is  merely  a  function  of  the  energy  of  the 
systems. 
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Such  ensembles,  in  which  the  density  of  distribution  does  not  change 
with  the  time,  are  said  to  be  in  statistical  equilihrinm.  It  will  be  noted 
that  the  canonical,  and  micro-canonical  distributions,  mentioned  in  Sec- 
tion 19,  would  give  ensembles  which  are  in  statistical  equilibrium.  This 
can  also  be  shown  to  be  true  for  the  surface  distribution  given  in  that 
section.^ 

22.  Equation  (25)  gives  us  a  general  expression  for  the  rate  at  which 
the  density  of  distribution  of  an  ensemble  is  changing  at  a  particular 
point  in  the  phase  space,  and  to  indicate  the  fact  that  we  were  interested 
in  the  change  of  density  at  some  such  chosen  stationary  point  we  have 
used  the  symbols  of  partial  differentiation  dg/dt.  We  can  also,  however, 
follow  the  motion  of  some  selected  phase  point  and  consider  the  change 
of  density  in  the  neighborhood  of  that  moving  point.  To  do  this  we 
merely  have  to  transpose  the  expression  in  brackets  to  the  other  side  of 
the  equation  and  write 

(l)o,.+  (e.t  +  ^-^  + +  <^"^  +  ^"^)=°(^^) 

The  result  gives  us  a  theorem  of  wonderful  simplicity.  Since  the  density 
is  a  function  of  the  time  t  and  the  coordinates  and  momenta  Qi  .  .  .  .  Pm, 
its  total  rate  of  change  with  the  time  will  evidently  be  obtained  by  add- 
ing to  the  partial  rate  of  change  with  the  time,  partials  of  the  density 
with  respect  to  the  other  quantities  multiphed  by  their  rate  of  change 
with  the  time.  Hence  the  full  expression  gives  us  the  total  rate  of 
change  of  the  density  for  any  distribution  of  the  ensemble  provided  we 
follow  a  moving  point  in  the  phase  space.  So  that  using  total  differentials 
we  can  write  in  general 

1  =  0  (28) 

This  result  is  sometimes  called  Liouville's  theorem,  or  the  principle  of  the 
conservation  of  density  in  phase. 

The  possibility  of  obtaining  this  significant  result  is  dependent  on 
our  choice  of  generalized  coordinates  and  momenta  as  the  2  m  axes  for 
our  phase  space.  If  we  had  plotted  our  representative  points  in  a  phase 
space  in  which  the  axes  were  the  coordinates  and  velocities,  we  should 
not  in  general  have  been  able  to  achieve  so  simple  a  result,  since  we 
could  not  then  have  obtained  the  simplification,  arising  from  the  intro- 
duction of  equations   (24),  which  are  dependent  on  the  use  of  the  equa- 

*  See  P.  and  T.  Ehrenfest,  Encykl.  dcr  Math.  Wiss.,  IV  2,  II  Heft  6,  page  29. 
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tions  of  motion  in  the  Hamiltonian  form  involving  the  generalized 
momenta.  This  is  the  reason  why  the  equations  of  motion  in  the  canonical 
Hamiltonian  form  are  so  important  for  statistical  mechanics  and  why  in 
our  future  work  momenta  rather  than  velocities  will  practically  always 
appear  in  the  expressions  we  obtain.  It  is  to  Boltzmann  that  we  owe 
the  clear  recognition  of  the  importance  of  using  momenta  in  statistical 
mechanical  considerations. 

23.  With  the  help  of  equation  (28)  we  can  obtain  a  further  con- 
clusion as  to  the  motion  of  phase  points  of  great  importance.  Suppose 
we  consider  a  small  but  finite  element  of  volume  dQx  .  .  .  .  dPm  in  the 
phase  space.     It  will  evidently  contain  the  number  of  phase  points 

N=QdQ, dPn,  (29) 

Let  us  now  follow  the  motion  of  this  element  containing  these  phase 
points,  through  the  phase  sjxice,  as  the  points  themselves  move.  Since 
no  systems  are  created  or  destroyed  we  shall  evidently  have 

^-^  =  ^dQ,....dP^  +  Qf^    (dQ,....dPm)=0  (30) 

The  first  of  the  two  terms,  however,  is  equal  to  zero  in  accordance  with 
equation   (28),  and  hence  we  obtain  the  important  conclusion 

■^^(dQ^....dP^)=0  (31) 

This  is  called  the  principle  of  the  conservation  of  extension  in  phase. 
In  accordance  with  this  principle,  if  we  draw  a  boundary  surface 
around  the  points  in  a  given  region  of  the  phase  space,  and  follow  the 
motion  of  these  points  as  they  describe  stream  lines  in  the  space,  the 
shape  of  the  bounding  surface  may  change,  but  its  volume  will  not  alter. 

24.  On  the  Properties  of  a  Microcanonical  Ensemble. — Most 
of  our  applications  of  statistical  mechanics  to  molecular  systems  will 
be  based,  as  we  shall  later  show,  on  the  device  of  using  a  microcanonical 
ensemble  of  systems  with  energies  in  the  range  E  to  E  -\-  dE  to  give 
an  idea  of  the  probable  behavior  of  a  single  system  having  an  energy  in 
that  same  range.  We  shall  first  need  to  consider  some  of  the  simpler 
properties  of  such  an  ensemble. 

First  of  all  it  should  be  noted  that  the  considerations  of  Section  21, 
see  equation  (26),  have  shown  us  that  the  uniform  .density  of  phase 
points  in  a  microcanonical  ensemble,  lying  in  the  shell  between  E 
and  E  +  dE,  will  not  alter  with  the  time.  Hence,  if  at  any  time  we 
pick  at  random  some  system  from  the  ensemble,  it  will  have  the  same 
chance  of  coming  from  any  one  of  the  equal  small  regions  dQi . .  .  .  dPm 
into  which  we  could  divide  the  region  between  E  and  E  -{-  dE. 
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To  put  this  result  in  somewhat  more  convenient  language,  we  shall 
define  the  microscopic  state  of  a  system  as  being  determined  by  the  par- 
ticular small  region  dQ^  ....  dPm  in  which  the  representative  point  for 
the  system  happens  to  lie.^  We  may  then  state  that  in  a  micro  canonical 
ensemrhle  all  the  different  microscopic  states  have  the  same  probabilities. 

Two  further  conclusions  can  immediately  be  drawn  from  this  principle. 

In  the  first  place,  in  most  of  our  statistical  considerations  we  shall 
not  be  primarily  interested  in  the  probability  of  only  a  single  microscopic 
state,  but  rather  in  the  combined  probability  of  any  one  of  a  collection 
of  microscopic  states  all  having  some  feature  of  interest.  Thus  to  give 
an  example,  when  we  are  considering  the  vapor  pressure  of  a  crystal 
we  shall  be  interested  in  the  total  probability  of  the  collection  of  all 
possible  states  in  which  a  given  number  of  the  molecules  are  in  the 
crystal,  the  rest  being  in  the  form  of  vapor.  It  is  immediately  evident 
from  the  principle  stated  above  that  in  a  microcanonical  ensemble  the 
combined  probability  of  such  a  collection  of  microscopic  states  is  pro- 
portional to  the  total  volume  in  the  phase  space  corresponding  to  the 
specified  properties. 

In  the  second  place,  we  shall  often  be  interested  in  the  average  of 
some  property  either  for  all  the  systems  in  the  ensemble  or  for  those 
in  a  group  having  some  feature  in  common.  It  is  immediately  evident, 
however,  since  all  the  microscopic  states  have  the  same  probability  that 
in  a  microcanonical  ensemble  an  average  can  be  taken  by  summing  the 
values  of  the  property  in  question  for  all  the  microscopic  states  and 
dividing  by  the  number  of  such  states  in  the  group. 

25.  Application  to  Molecular  Systems. — The  foregoing  simple  re- 
sults provide  practically  all  that  is  necessary  as  a  fundamental  apparatus 
for  the  applications  of  statistical  mechanics,  and  we  are  now  ready  to 
discuss  the  behavior  of  actual  systems  of  molecules. 

Let  us  consider  a  system  of  molecules  enclosed  in  a  suitable  con- 
tainer and  having  the  energy  content  E.  In  general  this  energy  content, 
and  the  components  of  linear  and  angular  momenta  of  the  whole  sys- 
tem, which  we  shall  take  as  zero,  will  be  the  only  dynamical  quantities 
of  which  we  have  any  knowledge,  since  the  motions  of  the  individual 
particles  are  too  complex  for  us  to  follow.^  Hence,  since  we  do  not 
know  the  precise  molecular  configurations  and  velocities  of  our  system, 
we  must  consider  all  possible  phases  that  such  a  system  with  the  known 

*We  shall  usually  find  it  convenient  to  regard  the  regions  dQi  .  .  .  dPm  as 
small  but  finite,  so  that  there  will  be  only  a  finite  number  of  microscopic  states. 

^  The  fact  that  they  are  beyond  the  powers  of  our  observational  methods  is  a 
secondary  matter.  The  methods  would  be  just  as  necessary  in  the  case  of  a 
system  of  equal  complexity  even  if  composed  of  particles  of  size  large  enough 
to  observe  easily. 
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energy  content  E  might  assume,  and  to  do  this  we  can  use  a  micro- 
canonical  ensemble  of  systems  similar  to  the  one  of  interest  with  their 
representative  points  distributed  uniformly  throughout  a  shell  which  lies 
between  the  surfaces  determined  in  the  2  7n-dimensional  space  by  E  and 
E  +  dE. 

26.  Introduction  of  the  Ergodic  Hypothesis. — So  far  we  have 
proceeded  without  the  introduction  of  any  hypotheses,  except  that  the 
system  obeys  the  laws  of  dynamics  in  the  canonical  form.  To  go  fur- 
ther, however,  we  must  now  introduce  the  assumption  that  such  a  micro- 
canonical  ensemble  of  systems  having  energies  between  E  and  E  -f-  dE 
gives  a  fair  representation  of  the  different  phases  which  will  be  assumed 
in  the  course  of  time  by  a  single  system  having  an  energy  content  between 
E  and  E  -f-  dE.  To  state  this  more  specifically  we  shall  have  to  assume 
that  the  results  obtained  by  selecting  a  collection  of  systems  at  random 
from  such  an  ensemble  are  practically  the  same  as  would  be  obtained  by 
considering  a  given  system  at  random  times. 

A  certain  empirical  justification  for  this  assumption  is  given  by  the 
fact  that  our  later  developments  based  on  its  use  are  found  to  meet  the 
test  of  experiment. 

The  theoretical  justification  for  the  assumption  is  more  difficult  and 
less  certain,  though  nevertheless  plausible. 

27.  In  the  first  place  it  should  be  noted  that  the  maintenance  of 
uniform  density  in  a  microcanonical  ensemble,  guaranteed  by  equation 
(26),  means  that  the  representative  points  for  the  systems  have  no 
tendency  to  crowd  into  any  particular  region  of  the  phase  space.  If 
such  a  crowding  did  take  place,  the  representative  point  for  any  given 
system  would  at  any  arbitrary  time  be  more  likely  to  lie  in  such  a  region 
of  compression  than  elsewhere  in  the  phase  space.  In  the  absence  of 
such  crowding,  however,  we  can  definitely  state  that  a  given  system 
taken  at  any  raudom  time  has  an  equal  chance  of  being  in  any  one  of 
the  different  microscopic  states  through  ivHich  it  actually  passes  in  the 
course  of  its  history. 

This  definitely  valid  conclusion  is  of  course,  however,  not  identical 
with  the  statement  that  the  system  has  an  equal  chance  of  being  in  any 
one  of  all  the  microscopic  states  which  lie  in  the  phase  space  between  the 
surfaces  determined  by  E  and  E  -f  dE,  as  is  true  for  a  microcanonical 
ensemble,  since  a  given  system  may  not  in  the  course  of  its  complete 
history  be  able  to  reach  all  the  regions  between  E  and  E  -f-  dE. 

28.  To  meet  this  the  ergodic  hypothesis,  as  it  was  called  by  Boltzmann, 
or  the  principle  of  continuity  of  path,  as  it  was  called  by  Maxwell, 
makes  the  definite  assumption  that  a  system  of  molecules  actually  does 
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pass  through  all  possible  microscopic  states  consonant  with  its  energy 
content  before  completing  its  cycle  of  motions.  This  hypothesis,  if  true, 
would  of  course  make  the  selection  of  random  samples  from  a  micro- 
canonical  ensemble  absolutely  identical  with  the  consideration  of  a  given 
system  at  random  times  and  hence  would  justify  the  further  procedure 
we  wish  to  undertake. 

The  ergodic  hypothesis  in  its  simple  form,  however,  is  not  even 
plausible.  The  reentrant  orbits  of  astronomy  have  made  us  quite  familiar 
with  the  possibility  of  an  infinite  number  of  closed  paths  for  a  given 
dynamical  system  all  of  them  having  the  same  energy  content.  Indeed 
no  single  example  of  a  dynamical  system  of  more  than  one  degree  of 
freedom  has  ever  been  discovered  in  which  the  motion  carries  the  sys- 
tem through  all  possible  phases  consonant  with  its  energy  content. 

29.  It  seems  evident  that  we  must  reject  the  ergodic  hypothesis  in 
its  simple  form  as  not  having  sufficient  plausibility  and  investigate  other 
similar  assumptions  which  would  lead  to  equivalent  results.  We  may 
consider  three  further  simple  possibilities. 

a.  It  might  be  assumed  that  for  a  system  of  molecules  with  a  given 
energy  content  there  are  a  number  of  closed  paths  possible,  but  that  one 
of  them  is  infinitely  longer  than  all  the  others  put  together,  so  that  we 
could  neglect  the  possibility  of  the  system  being  on  any  path  but  the 
longest,  and  the  microscopic  states  lying  along  this  path  would  include 
practically  all  the  microscopic  states  in  the  region  between  E  and  E  -\-  dE. 
Such  a  possibility  was  apparently  first  suggested  by  Jeans,  but  we  must 
agree  with  him  in  rejecting  it,  as  being  no  more  plausible  than  the  assump- 
tion of  a  single  closed  path  itself. 

b.  A  second  possibility  would  be  to  assume  that  for  a  given  energy 
content  a  system  of  molecules  can  move  on  any  one  of  a  number  of 
closed  paths,  but  cannot  of  itself  get  from  one  of  these  closed  paths 
to  another.  It  might,  however,  be  fortuitously  removed  from  one  path 
to  another  by  external  agencies  in  such  a  way  as  to  have  an  equal  prob- 
ability of  being  in  any  microscopic  state  belonging  to  the  -correspond- 
ing microcanonical  ensemble.  This  point  of  view  has  also  been  sug- 
gested by  Jeans,  who  has  further  indicated  the  plausibility  of  regarding 
one  of  the  molecules  themselves  as  a  fortuitous  external  disturbing 
agency  which  acts  on  the  system  which  is  now  regarded  as  composed  of 
all  the  other  molecules  present.  This  assumption  would  of  course  lead 
to  the  desired   results. 

c.  A  third  possibility,  which  appears  to  the  present  writer  most  satis- 
factory for  the  purposes  at  hand,  is  to  grant  that  there  may  indeed  be 
more  than  one  closed  path  for  the  motion  of  a  system  of  molecules, 
with  a  given  energy  content,  perhaps  indeed  a  great  many  such  paths. 
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Lying  along  each  closed  path  will  be  a  succession  of  microscopic  states, 
through  which  the  system  must  pass  before  it  rearrives  at  the  starting 
point ;  this,  however,  it  will  inevitably  do,  since  we  shall  take  the  regions 
dQi  ....  dPm  which  specify  the  different  microscopic  states  as  small  but 
nevertheless  finite,  so  that  there  will  not  be  an  infinite  number  of  possible 
states. 

Let  us  now  consider  the  motion  of  a  given  system  when  left  to 
itself.  Its  representative  point  will  have  to  lie  on  one  of  the  possible 
paths  which  correspond  to  its  energy  content.  We  cannot  say  on  which 
path,  but,  as  we  saw  above,  we  can  say  that  at  any  random  time  it  has 
an  equal  chance  of  being  in  any  one  of  the  microscopic  states  that  lie  along 
this  particular  path. 

So  far  we  have  introduced  no  assumption.  Let  us  now  consider, 
however,  the  probable  nature  of  the  different  closed  paths.  It  is  evi- 
dent that  for  an  actual  system  containing  many  molecules  most  of  these 
closed  paths  would  have  to  be  very  long.  Consider  for  example  a  gas 
composed  of  many  molecules  moving  back  and  forth  between  the  walls 
of  its  container.  Even  if  the  molecules  collided  only  with  these  walls 
and  not  among  themselves,  it  is  evident  that  we  should  have  to  wait 
an  exceedingly  long  time  before  we  should  again  find  every  individual 
molecule  back  at  its  starting  point,  and  when  we  allow  for  collisions 
among  the  molecules,  the  time  is  enormously  longer  yet.  Hence  at  least 
the  vast  majority  of  the  paths  will  contain  an  enormous  number  of 
microscopic  states  taken  out  from  those  that  lie  in  the  total  shell  be- 
tween E  and  E  -\-  dE. 

It  hence  seems  reasonable,  since  so  many  states  have  to  be  taken,  to 
assume,  by  and  large,  that  the  microscopic  states  lying  along  the  particular 
closed  path  on  which  the  system  of  interest  happens  to  lie  are  them- 
selves a  fair  sample  of  the  different  kinds  of  states  present  in  the  whole 
region  of  phase  space  between  the  limits  E  and  E -{- dE.  If  this  be 
admitted,  any  large  collection  of  samples  chosen  at  random  from  the 
whole  microcanonical  ensemble  between  E  and  E  -\-  dE  will  have  prac- 
tically the  same  properties  as  the  collection  obtained  by  observing  the 
given  system  at  random  times,  and  the  use  of  the  microcanonical  ensemble 
as  giving  a  fair  representation  of  the  succession  of  states  arrived  at  in 
time  by  a  single  system  is  justified. 

To  illustrate  this  conclusion  by  a  specific  example,  we  shall  presently 
find  that  the  vast  majority  of  all  the  states  in  a  microcanonical  ensemble 
have  distributions  of  molecular  velocities  that  do  not  differ  appreciably 
from  Maxwell's  law.  We  shall  then  feel  justified  in  believing  that  this  is 
also  probably  true  for  the  vast  majority  of  the  states  lying  along  the 
particular  path  on  which  a  given  system  is  to  be  found. 

We  shall  henceforth  proceed  on  the  assumption  that  a  microcanonical 
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ensemble  may  be  taken  as  giving  a  fair  representation  of  the  successive 
states  of  an  actual  molecular  system. 

30.  Nature  of  the  Systems  to  be  Considered. — Our  further  con- 
siderations will  usually  deal  especially  with  systems  containing  a  large 
number  of  identical  elements  of  one  or  more  kinds,  for  example  mole- 
cules, atoms,  electrons,  modes  of  vibration  or  what-not,  and  for  short 
we  shall  use  for  the  present  the  general  name  molecules  as  applying  to 
such  elements. 

Evidently  the  total  number  of  degrees  of  freedom  of  such  a  system 
will  be  equal  to  the  sum  of  the  degrees  of  freedom  belonging  to  the 
individual  molecules  of  which  the  system  is  composed.  Furthermore,  we 
can  consider  the  coordinates  and  momenta  used  for  describing  the  sys- 
tem as  a  whole  as  being  the  total  collection  of  coordinates  and  momenta 
which  belong  to  the  individual  molecules.  Thus,  for  example,  if  we 
have  a  system  composed  of  N  molecules  each  having  n  degrees  of  free- 
dom, we  can  identify  the  coordinates  and  momenta  Qi  .  .  .  .Pm  belonging 
to  the  system  as  a  whole  with  the  A'^  sets  of  variables  gi  .  .  .  .  qnpi  •  .  .  .  pn 
belonging  to  the  N  molecules. 

In  the  case  of  systems  containing  many  identical  molecules,  it  should 
be  noted  that  we  must  think  of  these  molecules  as  being  separately 
identifiable  so  that  a  specification  of  the  microscopic  state  of  the  system 
gives  us  the  location  and  motion  of  each  individual  molecule,  and  a  new 
microscopic  state  would  result  even  if  all  we  did  was  merely  to  swap 
some  pair  of  molecules  with  regard  to  their  coordinates  and  momenta. 
It  is  evident,  however,  that  for  the  purposes  of  predicting  the  further 
gross  behavior  of  the  system,  such  a  swap  would  be  of  no  importance. 
This  is  one  of  the  reasons  why  we  shall  usually  be  interested  in  a  col- 
lection of  microscopic  states  rather  than  in  a  single  microscopic  state. 

31.  y-Weight  and  ^M-Weight. — In  concluding  this  chapter  we  may 
mention  a  nomenclature  that  we  shall  often  find  very  convenient. 

As  already  stated,  in  most  of  our  statistical  considerations  we  shall 
not  be  primarily  interested  in  the  probability  of  only  a  single  microscopic 
state,  but  rather  in  the  total  probability  of  a  collection  of  microscopic 
states,  all  having  some  common  feature  of  interest.  It  is  evident  that 
the  total  probability  of  such  a  collection  of  microscopic  states  will  be 
proportional  to  the  total  number  of  microscopic  states  involved,  since 
they  are  all  of  equal  probability.  Each  microscopic  state,  however,  corre- 
sponds to  a  particular  one  of  the  equal  regions  dOx  ....  dPm  into  which 
we  have  divided  our  y-space.  Hence  the  total  probability  of  our  collec- 
tion of  microscopic  states  will  be  proportional  to  the  total  volume  in 
the  y-space  which  corresponds  to  the  collection. 
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Adopting  the  convenient  language  of  Ehrenfest  and  Trkal,  we  shall 
call  the  collection  of  microscopic  states  of  the  system  in  which  we  are 
interested  the  state  of  the  system,  and  the  corresponding  volume  in  the 
phase  space  the  y-weight  of  the  state.  We  shall  represent  the  y-weight 
of  a  state  by  the  symbol  y  and  the  probability  of  the  state  will  be  pro- 
portional to  this  quantity. 

Similarly  we  shall  define  the  /^-weight  /<  of  a  given  state  of  a  par- 
ticular molecule  as  the  volume  of  the  ^^-space  in  which  the  representative 
point  for  the  molecule  falls.  In  the  case  of  a  dilute  system,  we  shall 
find  it  possible  to  calculate  the  y-weight  of  a  given  state  by  considering 
the  ;«- weights  of  the  molecules  of  which  the  system  is  composed. 


Chapter  4 

Derivation  of  the  Maxwell-Boltzmann  Distribution 

Law 

32.  Specification  of  a  Given  State  of  the  System. — Our  first 
application  of  the  principles  discussed  in  the  preceding  chapter  will  be 
to  determine,  for  a  physical-chemical  system  which  has  come  to  equi- 
librium, how  many  molecules  of  each  particular  kind  have  values  for 
their  coordinates  and  momenta  which  fall  in  any  particular  range.  The 
result  which  we  shall  obtain  is  called  the  Maxwell-Boltzmann  distribu- 
tion law  for  the  coordinates  and  momenta  of  a  system  of  molecules. 

Let  A,  B,  C,  etc.,  be  the  numbers  of  molecules  of  each  of  the  various 
kinds  which  go  to  make  up  the  complete  system,  a,  b,  c,  etc.,  being  the 
number  of  degrees  of  freedom  respectively  for  each  kind  of  molecule. 

In  order  to  specify  the  condition  of  any  one  of  these  molecules,  for 
example  of  the  kind  A,  we  shall  make  use  of  a  ^M-space  of  2a-dimensions 
corresponding  to  the  2a  coordinates  and  momenta  belonging  to  each  one 
of  these  molecules.  The  position  of  a  representative  point  in  this  phase 
space  will  evidently  give  the  condition  of  the  molecule. 

We  shall  divide  this  ^ti-space  into  equal  infinitesimal  regions  having 
the  volume 

doA  =  dqi.  .  .  .  dqadpi dpa  (32) 

and  number  them  off  by  the  designations  lA,  2A,  ZA,  etc.,  in  order  to 
distinguish  them  one  from  another.  In  general  the  first  three  coordinates 
qi,  ^2,  Qs  will  be  taken  as  the  Cartesian  coordinates  x,  y,  r  which  locate 
the  center  of  gravity  of  the  molecule,  and  the  first  three  momenta  pi,  p2  ps 
will  then  be  the  components  of  linear  momentum  nix,  my  and  mi  for  the 
motion  of  the  molecule  as  a  whole.  The  remaining  coordinates  and 
momenta  will  refer  to  the  internal  configuration  and  motions  of  the 
molecule,  for  example  its  rotation.  Hence  the  statement  that  the  phase- 
point  for  a  given  molecule  falls  in  a  particular  region  do^  will  give  us 
the  location  and  motion  of  the  molecule  in  space,  as  well  as  its  internal 
configurations  and  motions. 

We  can  now  specify  any  given  state  of  the  system  by  giving  the 
numbers  of  molecules  Ai,  A2,  A^, ....  Bx,  B2,  B3, .  .  . .  Ci,  C2,  C3,  etc.,  of 
the  different  kinds  which  have  coordinates  and  momenta  lying  in  the 

44 
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different  infinitesimal  regions  Nos.  \A,  2A,  ZA, ....  \B,  2B,  SB, .... 
IC,  2C,  SC,  etc.,  into  which  we  have  divided  the  //-spaces  for  the  different 
kinds  of  molecules. 

33.  Probability  of  a  Given  State  of  the  System. — We  must  now 
consider  the  probability  of  any  given  state  of  the  system,  since  we  shall 
presently  inquire  as  to  what  state  has  the  greatest  probability. 

To  do  this  we  must  first  note  that  a  state  of  the  system,  specified  in 
the  manner  just  described,  will  evidently  correspond  to  many  different 
microscopic  states,  since  if  we  had  any  exemplar  of  the  system  in  a 
microscopic  state  with  the  above  distribution  of  molecules,  we  could  evi- 
dently obtain  a  new  microscopic  state  with  the  same  distribution  of 
molecules  by  swapping  any  pair  of  molecules  of  the  same  kind  that  had 
their  coordinates  and  momenta  in  different  regions  of  the  /i-space.  Thus 
if  we  had  2  molecules  of  the  kind  A  in  each  of  the  regions  Nos.  lA 
and  2A,  we  could  rearrange  these  4  molecules  in  6  different  ways  and 
each  time  have  two  molecules  in  each  of  the  regions.  Hence  when  we 
consider  the  great  number  of  molecules  of  each  kind  available  for  such 
swaps,  it  is  evident  that  there  will  ordinarily  be  an  enormous  number 
of  individual  microscopic  states  corresponding  to  the  state  of  the  system. 

34.  We  are  now  ready  to  investigate  the  probability  of  such  a  state 
of  the  system. 

In  the  last  chapter  we  saw  that  the  probability  of  any  state  of  the 
system  could  be  taken  proportional  to  the  volume  of  the  y-space  corre- 
sponding to  that  state,  or  as  we  shall  say  to  the  y-weight  of  the  state. 

If  now  we  consider  any  single  microscopic  state  that  belongs  to  the 
total  collection  of  microscopic  states  that  go  to  make  up  the  state  in  which 
we  are  interested,  it  is  evident  that  each  molecule  will  have  to  fall  in  a 
specified  region  in  its  own  //-space,  so  that  the  representative  point  for 
the  whole  system  will  have  to  fall  in  a  perfectly  definite  region  in  the 
/-space,  having  a  hyper-volume 

do  =  (do^y  {dosY  (docY ....  (33) 

which  is  obtained  by  multiplying  together  the  volumes  of  //-space  assigned 
to  each  of  the  A  -{-  B  -\-  C  -\-,  etc.,  molecules  in  the  system.  The  result 
will  perhaps  be  more  easily  understood,  if  it  is  appreciated  that  each 
molecule  contributes  its  own  axes  in  the  //-space  to  the  total  number  of 
axes  necessary  for  the  y-space,  and  thus  do  as  given  by  equation  (33),  is 
the  product  of  infinitesimal  distances  along  each  of  these  2a A  -\-  2b B  -j- 
2cC-\-,  etc.,  axes.  Specifying  a  given  region  do  in  the  7-space  thus  tells 
the  values  of  the  coordinates  and  momenta  for  each  separate  molecule. 
Since  the  7- weight  of  any  state  of  the  system  is  the  volume  of  7-space 
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corresponding  to  it,  the  7-weight  of  the  above  particular  microscopic  state 
of  the  system  will  be 

y  =  da  (34) 

35.  We  are,  however,  interested  in  the  probability  or  7-weight  of 
the  total  collection  of  microscopic  states  which  correspond  to  the  state 
of  the  system  in  which  Ax,  A^,  A3,  etc.,  Bi,  B^,  Bs,  etc.,  Ci,  d,  C9,  etc., 
etc.,  molecules  have  coordinates  and  momenta  lying  in  the  regions  Nos. 
IA^2A,  3A,  etc.,  IB,  2B,  3B,  etc.,  IC,  2C,  3C,  etc.,  etc.  Evidently  the 
total  y-weight  of  this  collection  can  be  found  by  multiplying  the  7-weight 
of"  one  microscopic  state,  do,  by  the  total  number  of  microscopic  states 
in  the  collection.     We  obtain  for  the  7- weight  of  the  state 


M  [B  \c 

^  ~  \Ar  [A,  Ijis. . .  .\^  \B2  [Bs . . . .  [C,  [C,  [Cs 


do     (35) 


where  the  factor  which  multiplies  da  is  the  number  of  ways  in  which 
we  can  rearrange  all  the  molecules  of  the  system  so  as  to  get  microscopic 
states  belonging  to  the  collection.  To  show  this  latter  point,  we  note  that 
the  total  number  of  ways  in  which  we  could  arrange  A  molecules  in  a 

linear  series  would  evidently  be  A(A—l)(A—2) (2)(1)  =  \A. 

This  total  number  of  permutations,  however,  is  greater  than  the  num- 
ber in  which  we  are  interested,  since  some  of  the  different  permuta- 
tions would  result  merely  from  changes  in  the  order  of  the  molecules  in 
some  particular  region  doj^  and  this  would  not  give  us  a  new  microscopic 
state.     Hence  the  total  number  of  permutations  |y£  is  equal  to  the  number 

of  permutations  in  which  we  are  interested  multiplied  by  |^i  |/J2_  Ma 

which  is  the  number  of  permutations  that  do  not  involve  the  transfer 
of  a  molecule  from  one  region  to  another.  Hence  dividing  \A  by 
\Ax  \A2  1^3 ....  we  obtain  the  number  of  ways  of  rearranging  the  mole- 
cules of  kind  A  to  obtain  different  microscopic  states,  and  by  extension  to 
include  all  the  different  kinds  of  molecules  obtain  the  expression  given. 

36.  We  shall  almost  always  find  it  easiest  to  work  with  the  logarithm 
of  the  y-weight,  and  for  purposes  of  simplification  shall  introduce  for 
each  of  the  factorial  numbers  occurring  in  equation  (35)  the  Sterling 
approximation 

\N_=^2^(^Y  (36) 

This  formula  is  valid  only  for  large  numbers,  so  that  the  possibility  of 
using  it  in  equation  (35)  is  dependent  on  the  assumption  that  the  num- 
ber of  molecules  Ai  A2  A3 .  . . .  Bi  B2  B3 .  .  . .  etc.,  in  each  of  the  regions 
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into  which  we  have  divided  the  /(-spaces  for  the  different  kinds  of  mole- 
cules are  themselves  large.  Since  we  have,  however,  an  enormous  num- 
ber of  molecules  of  each  kind,  this  can  be  made  true  for  the  regions  of 
interest  in  the  //-space  by  not  taking  the  infinitesimals  do^,  do^,  etc.,  too 
small.  This  point  will  be  more  fully  discussed  in  a  later  section. 
Taking  the  logarithm  of  (36)  we  obtain 


log  I A^  =  A^  log  A^  —  A^  +  log  VZ^ 


(37) 


where  on  the  basis  of  our  assumptions  the  last  term  can  be  neglected  in 
comparison  with  the  other  two. 

37.    Taking  the  logarithm  of  the  expression  given  by  equation   (35) 
and  substituting  (37)  we  obtain 

log  Y=       A  log  A     4-  5  log  B     +  C  log  C     -f 
—A  ^B  —C  — 

—  Ai  log  Ai  —  A2  log  A2  —  As  log  As  — 

—  B,  log  B,  —  B;  log  B2  —  Bs  log  Bs  — 

—  Ci  log  Cx  —  C2  log  C2  —  C3  log  C3  — 


+  A 

+  A2 

+  A, 

+ 

+  ^1 

+  B2 

+  B, 

+ 

+  Cx 

+  C2 

+  C3 

+ 

-|-  log  do 

It  should  be  noted,  however,  that  the  total  number  of  molecules  of  any 
kind  must  be  equal  to  the  sum  of  the  numbers  in  the  individual  regions, 
so  that  we  have  for  example 

A  =  A,  +  A.  +  A, 


Hence  we  may  simplify  the  above  to  obtain  as  our  final  expression  for 
the  logarithm  of  the  y- weight  of  the  specified  state 


log  Y  =  ^  log  ^  +  ^  log  5  +  C  log  C  + 
—  2^i  log  A 

i 

-j-  log  dG 


^Bj  log  Bj 

3 


I'Cfc  log  Cic  — 

k 


(38) 


where  the  summations  2  are  to  be  taken  over  all  the  infinitesimal  regions 
i,  j,  k,  etc.,  for  the  /(-spaces  corresponding  to  each  kind  of  molecule. 

38.  State  of  Maximum  Probability  and  Derivation  of  the 
Maxwell-Boltzmann  Distribution  Law. — We  shall  now  be  specially  in- 
terested  in  the   state  of   maximum   probability  when  the  system   has  a 


48  STATISTICAL  MECHANICS  [Ch.  4 

given  volume,  energy  content,  and  composition.  The  condition  for  maxi- 
mum probability  will  evidently  be 

—  ^  log  7  =  2(log  Ai  +  \)dAi  +  2(log  Bj+\)dBi-{-....  =  0  (39) 
i  i 

where  the  variation  d  is  brought  about  by  a  change  in  the  numbers  of 
molecules  having  coordinates  and  momenta  in  the  different  infinitesimal 
regions.  This  variation,  however,  cannot  be  carried  out  entirely  arbi- 
trarily, since  we  desire  to  know  the  most  probable  condition  for  a  system 
having  a  given  volume,  energy  and  composition,  and  hence  must  main- 
tain the  volume,  energy  and  number  of  molecules  of  each  kind  in  the 
system  constant,  when  the  variation  is  performed. 

39.  The  maintenance  of  constant  volume  is  satisfied  if  our  variation 
does  not  involve  the  introduction  of  molecules  into  regions  corresponding 
to  values  of  the  coordinates  which  lie  outside  of  the  container,  and  this 
will  obviously  be  taken  care  of  since  we  shall  consider  only  the  same 
infinitessimal  regions  Nos.  \A,  2A  .  .  .  .  IB,  2B  .  .  .  .  etc.,  in  our  original 
and  varied  states  of  the  system. 

The  maintenance  of  constant  energy  can  be  achieved  if  we  introduce 
the  additional  equation 

dE  =  '^EidAi  +  ^ejSBj  +  llskdCu  +  .  .  .  =  0  (40) 

i  j  k 

where  Si  for  example  is  the  rate  of  change  in  the  energy  of  the  system 
per  molecule  of  the  kind  A  introduced  into  the  i'th  infinitesimal  region, 
as  given  by  the  equation  ^ 

_  dE 
^'~dAi  (41) 

Finally  the  maintenance  of  constant  composition  will  be  taken  care 
of  if  we  introduce  the  further  equations 

dA  =  ^dAi  =  0 

i 

SB  =  ^dBj  =  0  .42) 

dC  =  ^dCjc  =  0 

k 

etc. 

40.  We  now  have  the  set  of  simultaneous  equations  (39),  (40),  and 
(42),  all  of  which  must  be  satisfied.  The  equations  can  be  most  easily 
treated  by  the  well-known  method  of  undetermined  multipliers. 

^  For  dilute  systems  ei  will  be  the  actual  energy  of  a  molecule  in  region  i. 
For  concentrated  systems  Ei  can  depend  on  the  mutual  aaergy  of  the  molecules. 


38-42]  DERIVATION  OF  MAXWELL-BOLTZMANN  LAW       49 

Multiplying  equation  (40)  by  an  undetermined  quantity  //  and  equa- 
tions (42),  by  undetermined  quantities  -i^,  A^,  X^,  etc.,  and  adding  to 
(39)  we  obtain, 

2(log  Ai-\-l^  ^iEi^X^)dAi 
i 

+  2(log  S, ■  +  1  +  /'£;•  +  ^b)SBj  +  ....  =  0  (43) 

i 

and  this  equation  must  be  true  for  arbitrary  variations  dAi,  dBj,  etc.,  so 
that  we  obtain  as  the  solution 

log  Ai  +  I -^  insi -\-  Xa  =  0 

log  5,-  +  1  +  fiSj  -^Xs  =  0  (44) 

etc. 

It  will  be  noted  that  /<  is  the  same  quantity  for  all  the  different  kinds 
of  molecules  in  the  system,  but  that  there  is  no  relation  between  the 
quantities  X^,  A^,  Xf.,  etc.,  for  the  different  kinds  of  molecules. 

Equations  (44)  may  now  be  solved  for  the  numbers  of  molecules  Ai, 
Bj,  etc.,  and  written  in  the  more  convenient  form 

Ai  =  a  A  e—f^U 

Bj  =  aBe-'''i  (45) 

etc. 

where   a^,   as,  etc.,   are   themselves   constants    for   each   kind   of   mole- 
cule, corresponding  to  the  constants  X^,  Xg,  etc.,  in  equations  (44). 

41.  These  equations  are  as  a  matter  of  fact  an  expression  of  the 
M axiwll-B oltsmann  distribution  law  for  the  molecules  of  a  system  when 
it  is  in  its  state  of  ma^-imum  probability.  They  show  that  the  number 
of  molecules  of  any  kind,  for  example  A,  having  coordinates  and  momenta 
falling  in  f'th  infinitesimal  region  of  the  phase  space  appropriate  for 
plotting  the  condition  of  molecules  of  that  kind,  is  determined  by  the 
energy  increase  of  the  system  Si,  which  is  brought  about  by  the  intro- 
duction of  a  molecule  into  that  region.  Since  the  total  energy  of  the 
system  is  limited,  it  is  evident  that  /^  is  a  positive  quantity,  and  that 
the  number  of  molecules  per  infinitesimal  region  in  the  phase  space  de- 
creases as  £  increases. 

42.  Critique  of  the  Method  of  Derivation. — Before  proceeding  to 
the  expression  of  these  equations  in  the  more  familiar  forms  ordinarily 
used  in  making  applications  of  the  Maxwell-Boltzmann  law,  we  must 
consider  the  effect  resulting  from  the  introduction  of  the  Sterling  ap- 
proximation in  the  course  of  the  derivation. 


50  STATISTICAL  MECHANICS  [Ch.  4 

Since  the  Sterling  formula  for  factorial  N  becomes  rapidly  more 
nearly  true  as  N  is  increased,  we  might  expect  equations  (45)  to  be  sub- 
stantially true  for  those  regions  in  the  phase  space  where  the  number 
of  molecules  is  large,  that  is,  where  e  is  small.  On  the  other  hand,  for 
large  values  of  e  we  might  expect  the  equations  to  fail.  This  is,  more- 
over, indicated  by  the  form  of  equations  (45)  themselves,  since  for  large 
enough  values  of  £  we  should  evidently  arrive  at  fractional  numbers  of 
molecules. 

To  show  that  equations  (45)  can  really  be  taken  as  correct  for  those 
regions  in  the  //-space  where  the  numbers  of  molecules  are  large,  it  will 
be  sufficient  to  consider  a  simple  system  containing  N  molecules  all  of  the 
same  kind. 

In  analogy  with  equation  (35)  in  Section  35  we  can  write  for  the 
y-weight  of  a  particular  distribution  of  the  N  molecules  of  the  system 

\N 

—  —  do  (46) 


^  -  |A^  [iV^   |7V_3 . . 
or 

log  7  =  log  [^—  2  log  |A[i_  +  log  do  (47) 

where  Ni  is  the  number  of  molecules  in  the  i'th  region  of  the  ^u-space. 
For  the  variation  in  y-weight  with  variations  in  the  arrangement  we 
shall  then  obtain,  considering  only  regions  which  do  contain  large  numbers 
of  molecules, 

5  log  7  =  -  2        °J—  dNi=  0  (48) 

and  this  is  set  equal  to  zero  as  the  condition  for  a  maximum.  In  carry- 
ing out  variations  of  the  arrangement,  however,  we  shall  wish  to  keep 
the  total  energy  E  constant  and  the  number  of  molecules  A^"  in  the  sys- 
tem constant,  in  accordance  with  the  equations 

^s.SNi  =  0  (49) 

t 

and 

^dNi=0  (50) 

i 

Treating  equations  (48),  (49)  and  (50)  as  before  by  the  method  of 
undetermined  multipliers,  we  easily  obtain  the  result 

-^  +  ,e.  +  X  =  0  (51) 
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which  will  be  true  even  for  regions  containing  a  small  number  of  mole- 
cules   Ni. 

43.    For  regions  which  do  contain  large  numbers  of  molecules,  how- 
ever, we  can  use  the  Sterling  approximation  for  factorial  A'',  giving  us 


d 


-  log  V~2^i  (^Y'+  tiei-\-X  =  0 
+  log  N i -\-  liBi -\-  X  =  0 


dN 
or 

1 

2Ni 

and  since  Ni  is  large  the  first  term  can  be  neglected,  and  the  ec^uation 
solved  in  the  form 

Ni  =  ae  -f^'i  (52) 

which  agrees  with  equations  (45). 

We  thus  see  that  the  Maxwell-Boltzmann  distribution  law  does  hold 
for  all  the  regions  which  contain  large  numbers  of  molecules,  and  this 
is  all  that  is  necessary  for  our  further  purposes,  since  we  can  neglect 
the  effect  of  molecules  in  the  remaining  regions. 

44.  Probability  of  States  Which  Deviate  from  the  Maxwell- 
Boltzmann  Distribution. — The  Maxwell-Boltzmann  law  gives  us  the 
distribution  of  the  coordinates  and  momenta  of  the  molecules  when  the 
system  is  in  its  state  of  maximum  probability.  We  must  now  investigate 
the  probability  of  states  which  deviate  from  the  Maxwell-Boltzmann  dis- 
tribution. We  shall  find  that  the  Maxwell-Boltzmann  distribution  is 
not  only  the  most  probable  but  very  much  more  probable  than  distribu- 
tions which  differ  even  moderately  from  it. 

For  simplicity  let  us  take  a  system  containing  A''  molecules  of  only  a 
single  kind.  Restricting  ourselves  to  the  consideration  of  regions  con- 
taining large  numbers  of  molecules,  we  can  write  in  accordance  with 
equation  (38)  of  Section  37  for  the  7-weight  of  any  distribution 

log  7  =  A^  log  A^  —  2AAf  log  A^i -f  log  (/o  (53) 

i 

and  for  the  y-weight  of  a  neighboring  different  distribution 

log  {y-{-Ay)  =NlogN—^(Ni  +  ANi)\og  (Ni  +  ANi)  -f  log  Jo     (54) 

i 

where  Ay  is  the  change  in  /-weight  and  ANi  is  the  change  in  the  num- 
ber of  molecules  in  i'th  region  when  we  pass  from  the  original  to  the 
neighboring  state.     Subtracting  (53)   from  (54),  we  obtain 
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log  L±Ay  ^  _  2(Ar,  +  ANi)  log  {Ni  +  zliVi)  +  2  iV^  log  Ni 
7  i  i 

=  - 2  JA^i  log  (l +^^  +  zlTVaog^l +^  j  +  zliVi  log  TV  j. 

Taking  now  the  changes  ANi  small  enough  so  that  we  can  expand  the 
logarithm  of  (1  -|-  ANi/Ni)  into  a  series  and  neglect  higher  terms,  we 
obtain 

+  ANi  1^  -....^+AN,  log  iVi] 

:=  —  2  ^  iV,  ("4^')  —  2zliV,  —  ^ANi  log  iV,  . 
i  V  ^Vi  /        i  < 

Let  us  now  take  the  original  distribution  corresponding  to  equa- 
tion (53)  as  the  state  of  maximum  probability  agreeing  with  the  Maxwell- 
Boltzmann  law.  We  can  then  substitute  for  Ni  in  the  last  term  the 
expression  given  by  equation  (52),  and  write 

log  ^      =  —  i  2A^i  (^\  —  2zlA^i  —  log  a^ANi  +  fiXsiANi 

/max.  I  \   Ni     /  i  i  i 

where  ymax.  is  the  probability  of  the  Maxwell-Boltzmann  state,  and  y 
that  of  the  state  which  deviates  therefrom.  It  is  immediately  evident, 
however,  that  the  last  three  terms  in  this  equation  must  be  zero,  since 
we  must  preserve  the  total  number  of  molecules  and  total  energy  of  the 
system  constant.     We  obtain  as  our  final  expression 

log^      =-iXN,(fiy  (55) 

/max.  {  \    -'^i    / 

and  note  as  should  evidently  be  true  that  any  change  from  the  Maxwell- 
Boltzmann  distribution  will  lead  to  a  decrease  in  probability  dependent  on 
the  second  power  of  the  deviations, 

45.  It  is  at  once  evident  from  equation  (55)  that  the  /-weight  is 
very  sensitive  to  deviations  from  the  Maxwell-Boltzmann  distribution  law, 
whenever  large  numbers  of  molecules  are  involved.  Consider  for  ex- 
ample that  we  had  two  similar  regions  in  the  w-space  each  containing"  in 
the  most  probable  state  as  small  a  number  as  1000  molecules,  then  the 
transfer  of  one  per  cent  of  these  molecules  from  one  of  these  regions  to 
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the  other  would  alone  reduce  the  probability  of  the  configuration  by  ten 
per  cent.  Deviations  involving  larger  numbers  of  molecules  would  have 
an  even  greater  effect.  Thus  if  we  had  a  cubic  centimeter  of  gas  con- 
taining 2  X  10^^  molecules,  and  considered  a  state  with  an  average 
deviation 

4^-0.01 

Ni 

for  all  regions,  we  should  obtain  for  the  ratio  of  probabilities 

so  that  a  state  with  an  average  deviation  of  one  per  cent  from  the 
normal  number  of  molecules  in  all  regions  of  the  //-space  would  have 
almost  no  chance  of  occurring. 

We  may  conclude  that  in  a  system  containing  a  large  number  of 
molecules  appreciable  deviations  from  the  Maxwell-Boltzmann  distribu- 
tion will  have  a  very  small  probability  of  occurrence.  Hence  we  may 
take  the  Maxwell-Boltzmann  distribution  law  as  substantially  represent- 
ing the  state  of  the  molecules  in  a  system  which  has  come  to  equilibrium, 
and  may  proceed  to  deduce  from  it  the  properties  of  such  systems. 

46.  Value  of  the  Constant  a  Occurring  in  the  Maxwell-Boltz- 
mann Distribution  Law. — We  shall  now  reftirn  to  the  equations  which 
we  obtained  in  Section  40  for  the  Maxwell-Boltzmann  distribution  law 

Ai  =  a^e—i^U 

Bj  =  aBe-''',  (56) 

etc. 
or  in  general 

Ni  =  ae-'''i  (57) 

and  show  the  methods  of  transformation  into  the  more  familiar  expres- 
sions which  are  convenient  for  actual  application  to  physical-chemical 
problems.  To  do  this  we  shall  first  need  to  obtain  more  rational  ex- 
pressions for  the  constants  a  and  pi  which  have  so  far  been  undetermined. 

47.  The  quantity  Ni  occurring  in  equation  (57)  is  the  number  of 
molecules  at  equilibrium  whose  coordinates  and  momenta  have  values 
falling  in  the  i'th.  one  of  the  equal  infinitesimal  regions 

do  =  dqi dpn  (58) 

into  which  we  have  divided  the  2  n-dimensional  ^a-space  used  for  plotting 
the  values  of  these  coordinates  and  momenta.     It  will  obviously  be  natural 
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to  take  this  number  proportional  to  the  total  number  of  molecules  N  oi 
the  kind  in  question  and  to  the  volume  do  of  the  infinitesimal  region. 
This  we  can  do  by  writing 

a  =  NCdq,  .  . . .  dpn  (59) 

where  C  is  a  new  constant.  Substituting  into  (57)  and  changing  the 
notation  to  that  corresponding  to  the  differential  form,  we  can  then  write 
the  Maxwell-Boltzmann  distribution  law  in  the  more  familiar  way 

dN  =  NCe-i"'  dq^ dpn  (60) 

where  dN  is  the  number  of  molecules  whose  coordinates  and  momenta 
have  values  falling  in  an  infinitesimal  range  dq^ ....  dpn,  the  quantity  e 
being  the  energy  increase  in  the  system  per  molecule  introduced  into  this 
region. 

48.  This  quantity  £  is  evidently  a  function  of  the  coordinates  and 
momenta  qx .  .  .  pn,  and  when  the  form  of  this  function  is  known  it  then 
becomes  possible  to  determine  the  value  of  the  constant  C  by  integration. 
To  do  this  we  may  write 

(  dN  =   r (  NCe-^'dqx dpn 

and  if  this  integration  is  carried  out  over  all  the  regions  of  the  jU-space, 
we  evidently  obtain,  since  A^"  and  C  are  constants, 

N  =  NC   (  ....   f  e-f''dqx....dpn 

or 

1 
C  = 


(....(e-i''dqy....dpn  (61) 

so  that  C  can  be  evaluated  whenever  the  indicated  integration  can  be 
performed.  Since  the  dependence  of  e  on  the  g's  and  p's  will  be  differ- 
ent for  different  kinds  of  molecules,  it  is  evident  that  the  constant  C 
will  also  depend  on  the  kind  of  molecule,  corresponding  to  the  lack  of 
relation  between  the  different  constants  a^,  a^,  etc.,  already  noted  in 
Section  40. 

49.  Value  of  the  Constant  fi  Occurring  in  the  Maxwell-Boltz- 
mann Distribution  Law. — To  obtain  the  value  of  the  constant  ^  occurring 
in  the  Maxwell-Boltzmann  distribution  law,  it  is  necessary  to  apply  the 
law  to  the  calculation  of  some  actual  macroscopic  property  of  the  sys- 
tem.    To  do  this  we  shall  consider  that  a  part  of  the  system  consists 
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of  a  container  of  volume  V,  enclosing  N  molecules  of  a  monatomic  gas 
so  dilute  that  the  forces  between  the  molecules  of  the  gas  can  be  neg- 
lected ;  in  other  words  A^  molecules  of  a  perfect  monatomic  gas.  We 
shall  then  calculate  the  pressure  of  this  gas  and  find  it  related  to  the 
quantity  (jl.  On  the  other  hand  the  pressure  of  such  a  gas  is  related  by 
the  gas  laws  to  the  temperature  T  of  the  whole  system,  so  that  we  shall 
finally  obtain  what  proves  to  be  a  very  simple  relation  between  n  and  T. 

Consider  a  gas  consisting  of  N  non-attracting  point  particles  of 
mass  m.  For  such  a  system  the  generalized  coordinates  may  be  taken  as 
the  Cartesian  coordinates  x,  y  and  z  locating  the  particles,  and  the  corre- 
sponding momenta  as  the  components  of  linear  momentum  mx,  iny  and 
1112,  so  that  the  Maxwell-Boltzmann  distribution  law  as  given  by  equa- 
tion  (60)  can  be  written  in  the  form 

dN  =i  NCe~i^^  dxdydsmdxmdymds  (62) 

where  dN  is  the  number  of  particles  having  coordinates  falling  in  the 
region  dx  dy  dz  and  velocities  in  the  range  dx  dy  dz.  Or  integrating  over 
the  whole  volume  of  the  container  V ,  we  obtain  for  the  total  number  of 
molecules  which  have  velocities  in  the  range  in  question 

dN  =  NCVm^e-f'  dxdydz  (63) 

and  denoting  constant  terms  by  the  single  symbol  A  may  write 

dN  =  NAe-f'  dxdydz  (64) 

50.  This  result  can  immediately  be  used  to  obtain  an  expression  for 
the  pressure  of  the  gas.  Consider  an  element  of  surface  dS,  for  con- 
venience perpendicular  to  the  X-direction,  and  let  the  pressure  of  the 
gas  be  P  so  that  the  force  exerted  on  the  surface  dS  by  particles  which 
are  coming  up  to  it  along  the  Z-direction  and  then  rebounding  is  PdS. 

Now  by  equation  (64)  the  total  number  of  particles  in  the  container 
having  a  component  of  velocity  in  the  A'-direction  between  x  and  x  -f-  dx 
is  evidently 

+  CO     +00      X  +  dx 


)       ^      (   NAe-i"^  dxdydz 


But  xdS  gives  us  the  volume  which  contains  the  number  of  particles  hav- 
ing the  velocity  x  which  will  reach  the  surface  dS  per  second.  Hence  the 
fraction  of  the  above  molecules  which  impinge  on  dS  per  second  will  be 
xdS/V  and  the  actual  number  will  be 


+  CO     +  CO     .r  +  dx 


^f      f      (  NAe-f"' dxdydz 
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Finally,  the  momentum  which  these  particles  give  to  the  surface,  allow- 
ing for  the  effect  of  the  rebound,  will  be 

+  00     +00      X  +  dx 

2mx  ^vr    \     )        \   NAe-"^  dxdydz 


00       -  OO         X 


The  force  PdS,  however,  which  the  gas  exerts  on  the  element  of  surface 
will  be  the  total  momentum  given  to  the  element  per  second  by  the  re- 
bounding molecules,  and  this  can  be  calculated  by  integrating  the  above 
expression  for  all  possible  values  of  x  between  zero  and  infinity.  We 
obtain 


+  00         +00         +00 


PdS  =?j^   f      C      C  NAe-i^'  mx^dxdydz 


00      -co       0 


Cancelling  dS,  transposing  V,  and  noting  that  symmetry  considerations 
will  permit  us  to  change  the  limits  of  integration,  we  obtain 


+  00         +00         +00 


PV=    C      r      ij    NAe-i'^mx-'dxdyds  (65) 


-  00         -  00         -  00 


51.  The  value  of  this  integral  can,  however,  be  obtained  by  a  simple 
artifice.  Returning  to  equation  (64)  and  substituting  for  the  energy  e 
its  evident  value  in  terms  of  the  velocities  fn'/2{x~ -\- y"^ -\- z^),  we  can 
write 


+  00         +00         +00 

N 

00         -   CO         -  00 


=    (      (       (  NAe-i'T^'^'  +  *'  +  -'=) dxdydz 


or  performing  the  integration  with  respect  to  x  by  parts,  we  obtain 

+  00+00  i-rr-j-00 

r      (   NAe-i'^-^^-^'  +  y'-^ -">  xdydz 


N  = 


-  00         -  00 

+  00        +00       +00 


X  —  00 


+    )       )       )    NAe-i^^^"^^  +  y^  +  ^"-^  jumx^dxdydz 


00        -  00        -  00 


Owing  to  the  negative  sign  of  the  exponent,  it  is  evident,  however,  that 
on  the  substitution  of  the  limits  the  first  term  on  the  right-hand  side  of 
this  equation  becomes  zero,  and  we  obtain 
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+  00         +00         +00 

N=     \        \        C  NAe-f' jum.v^dxdyds  (66) 


Comparing  now  with  equation  (65),  we  see  that  we  have  finally 
obtained  the  simple  result 

iV 

Pf'^y  (67) 

In  accordance  with  the  gas  laws,  however,  we  know  experimentally  that 
the  pressure  volume  product  for  such  a  gas  will  be  given  by  the  equation 

PV  =  NkT  (68) 

where  the  Boltzmann  constant  k  is  the  ordinary  gas  constant  R  divided 
by  the  number  of  molecules  in  one  mol.  of  gas.  We  thus  obtain  from 
(67)  and  (68)  the  desired  relation 

f'  =  W  (69) 

52.  This  equation  relates  the  quantity  /li  to  the  temperature  T  of  the 
perfect  monatomic  gas  which  we  assumed  as  part  of  our  equilibrium  sys- 
tem. The  quantity  has  as  we  saw,  however,  in  Section  40  the  same  value 
for  all  the  kinds  of  molecules  in  the  system,  and  furthermore,  the  macro- 
scopic quantity  T  is  obviously  the  same  throughout  such  an  equilibrium 
system,  so  that  the  relation  is  a  general  one.  It  should  also  be  noted  that 
the  inclusion  of  a  container  holding  a  perfect  gas  as  part  of  the  systerr, 
is  analogous  to  the  experimental  use  of  a  gas  thermometer  for  measur- 
ing the  temperature  of  the  system,  and  will  not  in  any  harmful  way 
affect  the  behavior  of  the  system. 

53.  Expression  of  the  Maxwell-Boltzmann  Distribution  Law  in 
Different  Forms. — Having  thus  obtained  the  familiar  expressions  for 
the  values  of  the  constants  C  and  /ti,  we  can  now  write  the  Maxwell- 
Boltzmann  distribution  law  in  the  usual  forms  which  prove  convenient 
for  applications.  Substituting  the  new  value  for  fi  into  equation  (60), 
we  may  write  as  our  first  fundamental  form  of  the  law 

£ 

dN  =  NCe  ~kf  dq^...dpn  (70) 

where  in  accordance  with  equation  (61),  the  constant  C  has  the  value 

1 


C  = 


I    \   e     kT  dqi...dpn  (71' 
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the  integration  to  be  taken  over  all  regions  of  the  ^a-space.  Combining 
(70)  and  (71)  we  may  eliminate  C  and  obtain  the  distribution  law  in  the 
form 

£ 

,-,  Ne~'kfdqi dp„ 

^^  =  -r r  -± 

I   I   e     kTdqi dpn  (72) 

Still  another  form  of  the  distribution  law  which  is  sometimes  use- 
ful is  obtained  by  the  introduction  of  a  constant  ip  defined  by  the  equation 

NC  =  eW  (73) 

which  leads  to  the  expression 

■A  — e 

dN  —  e    kT    dqi dpn  (74) 

54.  Method  of  Obtaining  Averages  from  the  Maxwell-Boltz- 
mann  Distribution  Law. — Since  the  Maxwell-Boltzmann  expression  for 
dN  gives  the  number  of  molecules  having  coordinates  and  momenta  in 
the  range  dqi  ....  dpn,  it  is  evident  that  the  average  value  of  any  prop- 
erty P  of  the  molecules  which  is  dependent  on  the  coordinates  and 
momenta  can  be  calculated  from  the  equation 

P,,.  =  -i  fpdN  =   C  ....    Cce~if  Pdq^ .  .  .  dPn  (75) 

where  in  performing  the  integration  the  quantity  P  is  to  be  regarded  as  a 
function  of  the  coordinates  and  momenta  q^  . .  .  .  pn,  and  the  integration  is 
to  be  taken  over  the  whole  or  a  part  of  the  /i-space,  according  as  the 
average  is  desired  for  the  whole  or  a  part  of  the  molecules. 

55.  As  an  illustration  of  the  usefulness  of  this  method  of  averaging, 
we  can  use  it  to  show  the  possibility  of  obtaining  a  simple  expression 
for  the  temperature  coefficient  of  the  quantity  C,  occurring  in  the  Max- 
well-Boltzmann expression,  even  in  the  absence  of  a  knowledge  as  to  its 
absolute  value.  Returning  to  the  expression  for  C  given  by  equation  (71) 
and  differentiating  with  respect  to  the  temperature  we  obtain 

d£_ J 

Substituting  equation  (71)  itself,  however,  this  may  be  rewritten  in  the 
form 

?5f  =  -J---J^'""'A?5''«'---''^" 
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and  in  accordance  with  equation  (75)  this  gives  us 

d  log  C e 

dT    ~~'Fn 

where  e  is  the  average  value  of  the  energy  e. 


(76) 


56.  Change  of  Variables. — In  the  application  of  the  Maxwell- 
Boltzmann  law  to  further  problems,  it  is  sometimes  necessary  to  make  a 
change  in  the  variables  qi .  . . .  pn.  To  do  this  we  can  make  use  of  a 
theorem  due  to  Jacobi  which  connects  the  expressions  for  an  infinitesimal 
area  in  different  sets  of  coordinates.  Consider  a  change  from  the  variables 
X  and  y  to  w  an  v^  where 


u  =  u{x,  y)    and   v  ^  v{.v,  y) 
We  can  then  substitute 


or 


u 

V 

J 

■V, 

y 

J 

£j 

1 

u. 

V 

dxdy  =  dudv 


dudv  =  dxdy 


(77) 


(78) 


(79) 


where  the  Jacobian  /  is  defined  as  the  functional  determinant 

du     du 

u,v  _   ^■^'  ^y 

x,y    ~     dv^    dv 
dx    dy 

The  method  can  be   extended  by   substituting   in   this  manner    for   co- 
ordinates a  pair  at  a  time. 

We  are  now  ready  to  undertake  the  application  of  the  Maxwell- 
Boltzmann  distribution  law  to  the  study  of  actual  physical-chemical 
problems. 


Chapter  5 

Application  of  the  Maxwell-Boltzmann  Distribu- 
tion Law  to  Problems  Involving  Molecular 
Velocities 

57.  Introduction. — It  often  happens  that  we  are  not  primarily  inter- 
ested in  the  values  of  all  the  coordinates  and  momenta  that  determine 
the  condition  of  a  molecule,  but  only  in  the  three  coordinates  which  give 
its  location  in  space  and  the  three  components  of  momentum  which  are 
related  to  its  motion  as  a  whole.  If  we  concentrate  our  attention  on 
these  three  degrees  of  freedom,  we  can  obtain  Maxwell's  law  for  the 
distribution  of  velocities  among  the  molecules  of  a  system,  which  may 
be  regarded  as  a  special  case  of  the  more  general  distribution  law,  and  is 
indeed  the  one  which  was  first  deduced. 

In  the  present  chapter,  we  shall  first  obtain  Maxwell's  law  for  the 
distribution  of  velocities  in  a  variety  of  convenient  forms,  and  shall  then 
use  it  to  investigate  the  pressure  volume  relations  of  gases,  and  to  deter- 
mine the  number  of  collisions  which  occur  between  the  molecules  of  a 
gas. 

58.  Maxwell's  Law  as  a  Special  Case  of  the  General  Distribu- 
tion Law. — Consider  a  system  containing  N  similar  molecules  each  hav- 
ing n  degrees  of  freedom.  Then  the  Maxwell-Boltzmann  distribution 
law  for  the  molecules  of  the  system  can  be  written  in  the  general  form 

dN  =  NCe~Tf  dqy....dpn  (80) 

Let  us  now  assume,  however,  that  the  energy  increase  of  the  system  e, 
per  molecule  introduced  into  the  various  regions  of  the  /^-space,  can  be 
taken  as  the  sum  of  three  terms 

E  =  e{x,y,z)^'l  (x^  -i-f-\-  i^)  -f  £i  (81 ) 

where  the  first  term  gives  the  potential  energy  of  the  whole  molecule 
as  a  function  of  its  coordinates  of  position  x,  y  and  z,  the  second  term 
its  kinetic  energy  as  a  whole  as  determined  by  its  mass  in  and  compo- 
nents of  velocity  x,  y  and  s,  and  the  last  term  gives  the  remaining  energy 

60 
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which  it  contributes  to  the  system  as  a  function  solely  of  the  remaining 
coordinates  and  momenta  qi  . .  . .  qnpi  .  . .  .  pn.  Whenever  the  system  is  of 
such  a  nature  that  this  separation  can  be  made  the  considerations  which 
are  to  follow  will  be  justified.  It  is  evident  that  this  will  be  true  for 
dilute  gases,  and  perhaps  closely  true  for  a  wider  variety  of  other  systems 
than  has  sometimes  been  thought  the  case,  provided  always  of  course  that 
quantum  restrictions  do  not  enter  the  problem. 

Since  the  momenta  corresponding  to  the  three  coordinates  x,  y  and  2 
will  now  evidently  have  the  values  mx,  my  and  ms,  we  can  now  rewrite  the 
Maxwell-Boltzmann  distribution  law  in  the  form 

__£(_  c(.r.  y.  ■-)  +    m/2  (.r-  +  .v' +  i-) 

dN  =  M(Ce     kT  dqt.  .dqn  dpi.  .dpnm^)e  kT  dxdydzdxdyds    (82) 

where  for  convenience  we  have  collected  inside  the  brackets  quantities 
which  are  not  dependent  on  the  positional  coordinates  or  velocities  of  the 
molecules. 

As  we  are  now  interested  for  our  present  purposes  in  all  the  mole- 
cules which  have  positions  and  velocities  falling  in  the  range  dx  dy  dz 
dx  dy  dz,  without  reference  to  the  value  of  other  variables  determining 
the  condition  of  the  molecule,  we  may  integrate  over  all  possible  values 
of  the  coordinates  and  momenta  which  occur  inside  the  brackets,  and 
thus  obtain  the  simple  expression 

t{x,y,s)  +    m/2  U' +  y' +  i") 

dN  =  NAe  kT  dxdydzdxdydz  (83) 

where  dN  is  the  number  of  molecules  falling  in  the  range  dx  dy  dz 
dx  dy  dz  and  A  represents  the  quantity  resulting  from  the  integration. 
The  equation  has,  we  see,  exactly  the  same  form  for  all  different  kinds 
of  molecules,  monatomic  or  polyatomic,  and  indeed  applies  quite  rigorously 
to  any  kind  of  molecules,  whenever  the  quantity  e  can  be  separated  into 
three  terms  as  discussed  above. 

59.  From  the  form  of  equation  (83)  it  is  immediately  evident  that 
the  number  of  molecules  in  any  region  of  the  container  dx  dy  dz  is  de- 
termined by  the  potential  energy  of  a  molecule  in  that  region,  and  this 
can  be  used,  for  example,  to  determine  the  variation  in  the  density  of  a 
gas  with  height  in  a  gravitational  field.  Furthermore,  from  the  quadratic 
and  symmetrical  form  of  the  kinetic  energy  we  see  that  the  velocities  of 
the  molecules  will  be  distributed  uniformly  in  all  directions. 

60.  Let  us  now  turn  our  attention  primarily  to  this  distribution  of 
velocities.  To  do  this  we  may  take  the  potential  energy  of  the  molecules 
as  zero  independent  of  their  location  and  integrate  over  all  the  regions  in 
the  total  volume  of  the  container  V  and  obtain 
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m  (i-'  +  y'  +  i') 
JA^  =  NVAe  ~n        kf  dxdydz  (84) 

for  the  number  of  molecules  having  velocities  in  the  range  dx  dy  dz 
and  this  is  Maxwell's  distribution  law.  It  can,  however,  be  expressed 
in  rather  more  convenient  forms. 

Let  us  first  introduce  polar  coordinates  with  the  help  of  the  evident 
expressions  (see  Section  56  in  the  last  chapter) 

C2  —  i-2   _^  ^2  _|.  _^2  (85) 

and 

dxdydz  =  c^  s'm^d'&d(j>dc  (86) 

where  c  is  the  total  velocity  and  ^  and  <^  the  angles  determining  the 
direction  of  motion.     Substituting  in  (84)  we  obtain 

dN  =  NVAe~Wrc^  smMM(f>dc  (87) 

and  integrating  over  all  values  of  the  longitudinal  angle  <f>  from  0  to 
271,  and  over  all  values  of  the  latitudinal  angle  <f)  from  0  to  ti  we  obtain 

tnc' 

dN  =  4nNVAe~lkf  c^dc  (88) 

The  value  of  the  constant  A  occurring  in  these  equations  can  now, 
however,  easily  be  determined  by  integration  over  all  possible  values  of  c 
from  zero  to  infinity.    We  evidently  obtain 


e~'2kf  c^dc  =  AnNVA  <  jW- 


Snk^T^ 


ntr 


0 

or 


61.    By  substitution  of  equation   (89)    in   (84),    (87)   and   (88),  we 
can  now  obtain  Maxwell's  law  in  the  very  useful  forms 

^^\      e"2         kf         dxdydz  (90) 

2^ ]      e~~2kfc-  s\nMMcf>dc  (91 ) 

2^)      e~lkfc'dc  (92) 

Finally  one  further  form  of  Maxwell's  law  will  be  found  useful  where 
the  range  is  expressed  in  terms  of  the  kinetic  energy  of  the  molecules 
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fi  =  -2-  (93) 

Introducing  in  (92)  we  obtain 

2  «      /- 

dN  =  N—^ g~IrVe  (/e  (94) 

62.  Average  Values  of  the  Velocity. — In  a  number  of  applications 
it  is  convenient  to  have  expressions  for  different  average  velocities  which 
can  be  obtained  from  equation   (92). 

We  easily  obtain  the  mean  velocity  by  multiplying  (92)  by  c,  integrat- 
ing over  all  values  and  dividing  by  N,  in  accordance  with  the  principles 
of  averaging  discussed  in  Section  54  of  the  last  chapter.  We  obtain  the 
result 

Q  =  Jll  =  14,500  JIhHi  (95) 

yl  7im  I'M  sec.  ^     ^ 

For  the  root-mean-square-velocity,  which  is  the  square  root  of  the  mean 
value  of  c^  we  obtain 

G  =  J^  =  15,800  V5  ^^  (96) 

T   m  yi  M  sec. 

and  for  the  most  probable  velocity  we  easily  obtain  by  the  conditions  for  a 
maximum 

W  =  V^^  =  12,900  VS  ^^  (97) 

J   m  jj  M  sec. 

It  will  be  noticed  that  these  different  average  velocities  do  not  differ 
greatly  from  one  another  owing  to  the  sharp  convergence  of  the  velocities 
around  the  most  probable  value.  In  the  numerical  expressions  for  the 
averages,  the  temperature  is  in  degrees  centigrade  absolute,  and  the  mass 
is  the  molecular  weight  of  the  substance.^ 

63.  The  Virial. — Our  first  application  of  this  knowledge  as  to  the 
distribution  of  molecular  velocities  will  be  to  obtain  a  general  expression 
for  the  pressure  of  a  gas.  As  a  preliminary  to  this  we  shall  first  find  it 
necessary  to  discuss  an  expression  called  the  virial  which  was  originally 
introduced  by  Clausius. 

Let  X,  y  and  s  be  the  coordinates  of  the  center  of  gravity  of  a  molecule 
of  mass  w  and  X,  Y ,  Z  the  components  of  force  acting  on  it.  Then  the 
equations  of  motion  will  be  of  the  form 

*  Numerical  values  taken  from  Dushman,  "High  Vacuum,"  Schenectady,  1922. 
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w^=Xetc.  (98) 


We  may  evidently  write,  however, 

d_ 
dt 


(m.r^]=m(^y+m.r^  (99) 

and  substituting  (98)  and  rearranging  obtain 

Let  us  now  integrate  this  equation  over  a  time  interval  between  t-^  and  fz, 
and  divide  by  the  interval  so  as  to  obtain  an  average  value  for  the  expres- 
sion on  the  left  hand  side.    We  obtain 


m 


0'+^=ir^.{(-4),r(»41,j    (>°^) 


where  averages  are  indicated  by  the  bars.  If  we  take  the  time  interval 
long,  however,  it  is  evident  that  the  quantities  on  the  left  hand  side  of 
the  equation  will  approach  steady  values,  while  the  quantity  on  the  right 
hand  side  will  approach  zero  since  infinite  values  of  the  terms  in  the 
brackets  do  not  occur.  Hence  averaged  over  a  long  time  interval,  or 
averaged  at  a  given  time  over  all  the  molecules  of  the  system  we  can  write 


and  adding  the  similar  expressions  for  the  other  axes,  summing  for  all 
the  molecules  and  dividing  by  2  we  obtain  the  remarkable  theorem 

2i^2  +  2|  (.^  +  ^H-iZ)  =0  (102) 

where  the  first  term  is  the  total  kinetic  energy  of  the  molecules  and  the 
second  is  the  virial  of  Clausius. 

64.    Application  of  the  Virial  to  Calculate  the  Pressure  of  a  Gas. 

In  order  to  apply  this  equation  to  determine  the  pressure  of  a  gas 
we  note  that  the  terms  in  the  virial  will  arise  from  two  causes, — the 
forces  between  the  molecules  and  the  walls  which  lead  to  the  pressure, 
and  the  forces  between  the  molecules  themselves. 

To  obtain  the  first  or  external  contribution,  let  us  for  simplicity  assume 
a  rectangular  container  having  the  sides  Ox,  Oy  and  Or.  Since  the  area 
of  the  two  faces  perpendicular  to  the  A'-axis  is  ys  and  the  x  coordinates 
are  0  and  x,  the  contribution  of   this  pair  of   faces   will  evidently  be 
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p 

—  y  xys,  so  that  the  total  contribution  of  the  walls  to  the  virial  will  be 

Ve  =  —  ^PV  (103) 

To  obtain  the  contribution  of  the  internal  forces  in  a  convenient  form, 
let  us  represent  by  /(r)  the  force  (taken  positive  for  repulsion)  acting 
between  two  molecules  at  the  distance  r  from  each  other,  having  the 
respective  positions  .r,  y,  s  and  x' ,  y' ,  z' .  The  force  acting  on  the  first 
molecule  evidently  has  the  components 

X  =  /(r)^^,  F  =  /(r)^^     and     Z  =  j{r)'-^ 

and  hence  contributes  to  the  virial  the  term 

t^[x{x-x-)  +3,(3,-3/)  +.^(.^_y); 

while  for  the  second  molecule  the  contribution  will  be 

2r 

or  adding  we  obtain 


{  x'{x'  —  x)  +  y  {y  —  y)  +  z'  {z'  —  z)  \ 


t^[^x'-xY-^r{y'-yy^{z'-zy\ 

so  that  for  the  total  contribution  of  the  internal  forces  to  the  virial,  we 
have 

Fi=rJ22r/(r)  (104) 

where  the  summation  is  to  be  taken  over  all  pairs  of  molecules. 

65.  Substituting  the  two  contributions  given  by  (103)  and  (104)  into 
(102)  and  rearranging  we  obtain  the  beautiful  relation 

PF  =  i  2  ;;;?  +  i  22  r  /(O  (lOS) 

The  first  term  on  the  right-hand  side  of  this  equation  has,  however, 
in  accordance  with  equation  (96)  the  value  NkT,  so  that  we  obtain  the 
gas  laws  in  the  form 

PF  =  A^^T  +  i  22  r/(r)  (106) 

and  at  once  see  that  for  a  gas  dilute  enough  so  that  we  can  neglect  the 
internal  forces  between  the  molecules  the  laws  for  a  perfect  gas  will  hold. 
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66.  Equation  (106)  can  now  be  made  the  starting  point  for  a  num- 
ber of  important  investigations  which  unfortunately  cannot  be  included 
in  the  present  work.  Thus  the  calculation  of  the  internal  part  of  the 
virial  due  to  the  Coulomb  forces  between  charged  ions  in  an  electrolytic 
solution  has  led  to  the  Milner-Debye  theory  of  interionic  attraction,  and 
by  making  suitable  assumptions  as  to  the  forces  between  the  molecules 
of  a  gas  it  has  been  possible  to  arrive  at  reasonable  expressions  for  the 
equations  of  state  of  compressed  gases  and  liquids. 

67.  As  a  single  illustration  of  the  latter  possibility  we  may  show  the 
method  of  obtaining  Van  der  Waals'  equation,  introducing  the  assump- 
tions that  he  makes,  only  as  a  possible  first  approximation.  To  carry 
out  the  calculation,  let  us  divide  the  term  ^  1,^rf{r)  into  two  terms 
^  22r/i(r)  which  arises  from  the  repulsion  of  molecules  when  they  are 
close  together  and  |  22  r/aCr)  which  arises  from  the  residual  attraction 
when  they  are  far  apart. 

With  regard  to  the  first  term  we  shall  assume  that  the  molecules 
behave  as  rigid  spheres  so  that  the  force  of  repulsion  always  acts  at  the 
same  distance  r  when  the  two  molecules  collide.  The  number  of  col- 
lisions, however,  can  evidently  be  taken  proportional  to  Vt/V  and  the 
force  which  is  dependent  on  the  relative  velocity  as  also  proportional  to 
VT  (see  Section  70),  so  that  the  first  term  can  be  written  as 

^22rA(r)=^  (107) 

where  5  is  a  proportionality  constant. 

With  regard  to  the  second  term,  we  shall  assume  with  Van  der  Waals 
that  the  forces  of  attraction  can  be  regarded  as  balancing  out  in  the 
interior  of  the  fluid,  and  need  only  be  considered  at  the  boundary  where 
they  will  manifest  themselves  as  a  cohesion  pressure  Pi,  so  that  using 
the  same  considerations  that  led  to  equation  (103),  we  can  write 

i22r/2(r)=— PiF  (108) 

Or  if  we  assume  again  with  Van  der  Waals  that  this  cohesion  pressure 
is  inversely  proportional  to  V^,  since  it  is  dependent  both  on  the  number 
of  molecules  at  the  boundary  and  the  number  within,  which  are  doing 
the  attracting,  we  can  write 

i22r/3(r)=-^  (109) 

Substituting  (107)  and   (109)  into  (106),  we  obtain 

PV  =  NkT  +  ^-^ 
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or  referred  to  one  mol  of  gas 

(^+^)^  =  ^r(n-^)  (110) 

where  Z^  is  a  new  constant,  which  can  be  shown  to  be  related  to  the  volume 
of  the  molecules  themselves. 

For  small  values  of  the  ratio  h/V,  this  can  be  written  in  the  Van  der 
Waals'  form 


(^  +  f) 


{V~h)=RT  (111) 

The  result  is  of  course  not  entirely  satisfactory,  because  of  the  assump- 


tions. 


68.  Still  making  the  hypothesis  that  the  repulsive  forces  may  be  re- 
garded as  arising  from  the  collision  of  rigid  spheres,  but  making  more 
general  assumptions  as  to  the  residual  attractive  forces  and  not  neo-lect- 
ing  their  action  within  the  interior  of  the  fluid,  Reinganum  ^  obtains  the 
equation 


c 


PV  =  RT{x+'f)-ii^  (112) 

where  the  exponential  term  arises  from  the  action  of  the  attractive  forces 
within  the  interior  of  the  fluid  and  the  form  of  ^{T)  depends  on  the  law 
of  attraction. 

69.  The  Number  of  Collisions  between  Molecules. — If  we  treat 
the  molecules  of  a  gas  as  rigid  elastic  spheres,  it  is  possible  to  calculate 
the  number  of  collisions  between  them  which  occur  per  second.  Althouo-h 
such  a  treatment  must  be  regarded  as  a  first  approximation,  it  never- 
theless seems  to  be  a  reasonably  good  one,  and  the  results  will  be  of 
considerable  later  interest,  especially  in  connection  with  the  rate  of  chem- 
ical reactions. 

Let  us  examine  the  chances  of  collision  between  two  molecules  having 
the  masses  Wi  and  mo,  and  diameters  Oi  and  02,  enclosed  in  a  container 
having  a  volume  of  one  cubic  centimeter.  At  the  instant  of  contact  the 
center  of  the  second  molecule  will  evidently  have  to  lie  on  a  sphere  drawn 
about  the  center  of  the  first  molecule,  with  a  radius  O12,  which  is  the  mean 

'Reinganum,  Ann.  der  Phys.,  6,  533  (1901).  For  another  equation  of  state, 
with  extensive  empirical  tests,  see  the  work  of  Keyes, — Proc.  Nat.  Acad  3  323 
(1917)  ;  11,  38  (1925)  ;  Journ.  Amer.  Chem.  See,  41,  589  (1919)  ;  43,  1452  '(1921)  • 
46,  1584  (1924)  ;  Amer.  Soc.  Refrigerat.  Eng.  Journ.,  7,  371  (1921);  Journ  Math' 
Phys.,  Mass.  Inst.  Tech.,  1,  211   (1922). 


68  STATISTICAL  MECHANICS  [Ch.  5 

of  the  two  diameters  Oi  and  02.  Consider  especially  now  collisions  in  which 
the  angle  between  the  direction  of  the  relative  velocity  V  and  the  line  of 
centers  at  the  instant  of  contact  lies  in  the  range  between  •&  and  ■&  -{•  dd-. 
Under  these  circumstances  the  center  of  the  second  molecule  at  the  instant 
of  contact  will  evidently  have  to  lie  on  a  ring  of  area  27iOi2^sm§d'&  cut 
out  of  the  sphere  of  radius  O12.  And  since  the  component  relative 
velocity,  parallel  to  the  line  of  centers  and  hence  perpendicular  to  the 
surface  of  this  ring  is  Fcosi?,  it  is  evident  that  the  center  of  the  sec- 
ond molecule  must  lie  in  a  volume  27iOx2^s\r\&dd^Vzos'^dt,  if  such  a  col- 
lision is  to  occur  within  a  short  time  interval  dt.  Neglecting  the  space 
occupied  by  the  molecules  themselves,  however,  the  chance  that  the  center 
of  a  molecule  will  lie  in  this  small  volume  is  equal  to  the  volume  itself 
divided  by  the  total  volume  unity,  so  that  we  may  write  for  the  chance 
of  such  collisions  per  second  2noi2^Vsmd^cos'&d'&. 

Taking  now  the  total  number  of  molecules  per  cubic  centimeter  of 
the  two  kinds  as  being  Ni  and  No,  we  can  use  the  equation  (90)  to  give 
us  the  number  of  each  kind  having  component  velocities  in  the  ranges 
of  interest  from  Xi  to  Xi  +  di'x,  ±2  to  X2  +  dx2,  etc.,  and  hence  can  write 
for  the  total  number  of  collisions  per  second  of  the  kind  described 

~^^  )    e 2kf V  sini>  cos§dx,dy,dz,dx,dy,dc2d»    (113) 

where  V  is  determined  by  the  values  of  Xi,  yi,  Zx,  X2,  y2  and  22. 

To  treat  this  expression  let  us  introduce  new  variables  u,  v,  w  and  a, 
/5,  y  defined  by  the  equations 

Wii^i  +  m2X2     ^  .  .       .  / 1 1  /I  ^ 

u  =  =-^  etc.  a  =  X2  —  Xi  etc.  ( 1 14) 

nil  -j-  W2 

so  that  u,  V  and  %v  are  the  components  of  the  velocity  of  the  center  of 
gravity  of  the  two  molecules  and  a,  ^  and  y  the  components  of  their  rela- 
tive velocity  V.     Then  if  we  put 

c^  ^  u" -\- v" -\- zv"         and         F^  =  a^  +  ^^  +  7^  (115) 

we  can  write  in  accordance  with  a  well-known  relation 

mxcx^  -f  yius.}  =  (m,  +  ^n.^c^  +     '^f''^      V^  (116) 

as  can  be  verified  by  the  substitution  of  equations  (114).  Also  in  accord- 
ance with  Jacobi's  theorem  (see  Section  56  in  the  last  chapter),  we  may 
substitutute  for  the  area  dx\  dx2  by  using  the  equation 


/ 


M,    a 
-I'l,  X2 


dxidx2  =  duda 
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and  since  the  Jacobian  is  easily  found  to  have  the  value  1,  we  see  that  by- 
considering  all  three  pairs  of  coordinates  we  can  write 

dxidyidsidxzdyzdsz  =  du  dv  dw  da  d^  dy  (117) 

Substituting  in  equation  (113)  we  obtain 

(■mm      \  %         ^'"■'  "*"  '"^^^  "^ +  '    '  ' 
^Jf^,j2  )    e 2kT  '     ""      V  sill!?  zosMu  dv  dzv  da  djS  dy  d9   (118) 

On  integrating  du  dv  dw  over  all  possible  directions  in  space  for  the 
velocity  of  the  center  of  gravity  c,  we  evidently  obtain  the  volume  Anc^dc 
and  similarly  integrating  da  d^  dy  over  all  possible  directions  for  the 
relative  velocity  V  we  obtain  AnV^dV  so  that  expression  (118)  may  now 
be  written  in  the  form 

dZ  =  32n^Y,N.o,.'  ( _nhnh_ \  %        <'"- +  "'->^  +  ;;i7T^ '"  c'V'  sint?  cos^^c  dV  dd     (1 19) 

And  finally,  since  we  shall  not  usually  be  interested  in  the  velocity  of  the 
center  of  gravity  we  may  integrate  this  over  all  values  of  c  from  zero  to 
infinity  and  obtain 

dZ  =  8N,N,o,,'  (0^3-^^^''''^     ..Y^~ ^^^+^^  ^  V'  sin^  cosMVd^    (120) 

This  is  the  number  of  collisions  per  second  in  which  the  relative 
velocity  lies  in  the  range  dV  and  the  angle  between  the  relative  velocity 
and  line  of  centers  at  the  instant  of  contact  lies  in  the  range  dd^. 

70.  The  total  number  of  oil  collisions  between  the  two  kinds  of  mole- 
cules may  now  be  obtained  by  integrating  over  all  possible  values  of  ?? 
from  0  to  7i/2,  and  all  possible  values  of  V  from  zero  to  infinity.  We 
obtain 

Z  =  2N,N,o.,^(?^^^I^^^^^^^±^Y  (121) 

\  W1W2  / 

71.  On  the  other  hand,  we  may  be  interested  in  the  number  of  col- 
lisions in  which  the  relative  velocity  V  has  a  certain  value.  By  integrat- 
ing equation  (120)  for  all  values  of  ^  we  easily  obtain  for  the  number 
of  collisions  in  which  the  relative  velocity  lies  in  the  range  dV  ^ 

'  See  Jeans,  "Dynamical  Theory  of  Gases,"  Section  363,  Second  Edition,  Cam- 
bridge, 1916. 
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n  mim2 


m-iin2  l^' 


dZ  =  4iV,iV.o..'  {^,.jl-;^  „„_y)    .  -  srns  ST  V'dV  (122) 

and  by  integrating  this  between  all  values  of  V  from  Fq  to   co,  obtain 
for  the  number  of  collisions  in  which  the  relaiwe  velocity  is  greater  than  Vq 

Z(Vo)=Z\l-\ ; ^n^F  I  ^       mi  +  ,n,2kT  (122a) 

\  7^1  -\-  m2  Ik  1  J 

where  Z  is  the  total  number  of  collisions. 

72.  Finally  in  our  application  of  rates  of  chemical  reaction,  we  shall 
be  specially  interested  in  the  number  of  collisions  in  which  the  compo- 
nent of  relative  velocity  parallel  to  the  line  of  centers 

R  =  Vcos'&  (123) 

has  a  certain  value. 

To  treat  this  problem  we  shall  find  it  simplest  to  return  to  equa- 
tion (120)  and  first  determine  the  number  of  collisions  in  which 
R  =  V  cos  ■&  is  greater  than  some  particular  value  Rq.  For  a  given  value 
of  F  >  Ro,  this  will  evidently  be  true  when  cos  •&  lies  between  unity  and 
Ro/V.     Hence  we  may  write  for  this  number  of  collisions 

»     cosS  =  Ro/V 

ZiRo)  =  f  fsN.N^o.,^  (^       "^y    .J'e-i^^Tk-^  V^  sin^  cosMMV 

Ro     cos^  =  1 

and  performing  the  integrations  obtain  for  the  number  of  collisions  in 
which  the  component  of  relative  velocity  parallel  to  the  line  of  centers  is 
greater  than  Rq. 

Z(Ro)=Ze    mi  +  m^2kT  (123a) 

where  Z  is  again  the  total  number  of  collisions.^ 

And  if  we  are  interested  in  the  number  of  collisions  in  which  the 
component  R  lies  in  a  particular  range  dR,  we  can  obtain  by  the  differ- 
entiation of  (123a),  the  equation 

111   111  1  iihiti2        R' 

dZ  =  Z    '7^    .  r^e~^^^r+^.2kTRdR  (123b) 

Wi  -}-  W2    Ki 

73.  The  foregoing  results  will  be  found  of  great  value  in  interpret- 
ing the  rates  of  chemical  reactions. 

In  case  we  desire  to  use  the  formulae  to  calculate  the  number  of  col- 

•See  Langevin  and  Rey,  Le  Radium,  10,  142  (1913). 
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lisions  between  like  molecules,  they  can  evidently  be  modified  by  putting 
nix  =  W2  =  m,  O12  =  o,  and  replacing  A''!  A^2  by  N'^/2. 

For  use  in  numerical  calculations,  the  following  formukie  for  total 
numbers  of  collisions  will  be  found  convenient  * 

For  unlike  molecules 

-^=10  CxC._a,.^^T-j^-^^ 

For  like  molecules 

Z  .^  28.2922    —    "       I  1^ 


^  =  10— CV^-^  (124a) 

where  A  is  Avogadro's  number,  so  that  Z/A  is  the  number  of  collisions 
per  cubic  centimeter  expressed  in  mols  colliding,  d,  C2  and  C  are  concen- 
trations in  mols  per  cubic  centimeter,  O12  and  o  are  in  centimeters,  T  is 
in  degrees  centigrade  absolute,  and  ilf  1,  Mo  and  M  are  ordinary  molecular 
weights. 

*  See  Dushman,  Journ.  Amer.  Chcni.  Soc,  43,  407   (1921). 


Chapter  6 

Application  of  the  Maxwell-Boltzmann  Distribu- 
tion Law  to  the  Theory  of  Energy  Partition 
and  to  Specific  Heats 

74.  Introduction. — In  the  present  chapter,  we  shall  use  the  Maxwell- 
Boltzmann  distribution  law  to  investigate  the  distribution  of  energy 
among  the  different  degrees  of  freedom  of  a  system.  We  shall  first 
derive  a  very  general  equipartition  law,  which  shows  that  there  is  a 
general  function  of  the  coordinates  and  momenta  of  a  system  which  has 
the  average  value  kT  for  each  coordinate  or  momentum  involved,  and 
which  reduces  to  the  principle  of  the  equipartition  of  energy  for  those 
coordinates  and  momenta  which  contribute  quadratic  terms  to  the  total 
energy  of  the  system.  We  shall  then  use  the  principle  of  the  equipartition 
of  energy  to  obtain  values  for  the  specific  heat  of  gases  and  solids,  pro- 
vided the  substances  are  in  temperature  ranges  such  that  the  results  are 
not  affected  by  quantum  restrictions. 

We  shall  specially  stress  the  fact  that,  even  in  the  absence  of  quantum 
restrictions,  the  principle  of  the  equipartition  of  energy  holds  only  for 
those  coordinates  and  momenta  which  contribute  quadratic  terms  to  the 
expression  for  total  energy,  and  shall  investigate  the  actual  partition 
of  energy  in  certain  cases  of  interest,  where  equipartition  does  not  obtain. 
This  mode  of  presentation  differs  from  those  usually  adopted,  which  have 
by  implication  often  given  the  impression  that  the  principle  of  the  equi- 
partition of  energy  was  always  a  necessary  consequence  of  the  classical 
dynamics,  when  in  reality  it  is  primarily  a  consequence  of  the  assump- 
tion that  the  energy  of  a  system  is  a  homogeneous  quadratic  function 
of  the  coordinates  and  momenta. 

75.  The  General  Equipartition  Law. — Starting  with  the  Maxwell 
Boltzmann  distribution  law,  it  is  possible  to  derive  a  general  equiparti- 
tion law  by  carrying  out  a  partial  integration.  In  accordance  with  the 
Maxwell-Boltzmann  distribution  law,  we  may  evidently  write 


/■■•/ 


Ce      kf  dqi . .  .  .dpn  =  1  (125) 

where  the  integrations  are  to  be  taken  over  all  possible  values  of  the 

72 
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coordinates  and  momenta.  Suppose  now  we  perform  a  partial  integra- 
tion with  respect  to  some  one  of  the  coordinates  or  momenta  qi  . .  . .  pn, 
say  for  simpHcity  with  respect  to  q^.  We  shall  evidently  obtain  in  this 
way  the  following  equation 


j...jCe-W  g^d,..  ...dp.  ""'  +/. . .  .fee  -  .%  1^^  d,....dp.=  l       ( 126) 

qi  =  lower 

limit 

76.  In  interpreting  the  above  equation  we  usually  encounter  two  sorts 
of  cases,  either  e  is  independent  of  the  variable  ^i  in  question,  or  else 
depends  upon  e  in  such  a  way  as  to  become  infinite  when  q-i  becomes 
infinite. 

An  example  of  the  first  possibility  is  given  when  qx  is  a  positional 
coordinate  for  a  molecule  of  a  dilute  gas  in  the  absence  of  an  external 
field.  In  such  a  case  the  second  term  in  the  equation  becomes  zero  since 
de/dqi  is  zero,  and  the  expression  reduces  to  an  equality  of  the  first  term 
with  unity,  an  evident  result  under  the  circumstances. 

In  the  cases  of  interest  for  our  present  considerations  8  is  dependent 
on  qi  and  becomes  infinite  when  q^  is  infinite.  Furthermore  at  the  lower 
and  upper  limits  qi  itself  assumes  either  the  values  zero  and  infinity  or 
the  values  minus  and  plus  infinity. 

This  will  be  true  for  example  when  qi  is  a  displacement  with  which 
potential  energy  is  associated,  the  amount  of  potential  energy  increasing 
without  limit  with  the  displacement.  It  will  also  be  true  in  the  very 
usual  case  when  the  variable  in  question  is  a  momentum,  an  infinite  value 
of  kinetic  energy  being  associated  with  the  infinite  positive  or  negative 
value  assumed  by  the  momentum  at  its  upper  and  lower  limits. 

77.  Under  the  conditions  assumed,  the  first  term  of  equation  (126) 
becomes  zero,  at  both  limits  either  because  qi  itself  is  zero  at  the  limit,  or 
because  the  infinite  value  of  the  negative  exponent  of  c  outweighs  the 
infinite  value  of  q^.     The  equation  can  then  be  rewritten  in  the  form 

J.  .  .Cce  ~  Fr{  g^p-\dq,...  dpn  =  kT  ( 127) 

In  accordance,  however,  with  the  principles  of  averaging  discussed  in 
Section  54  of  Chapter  4  the  left-hand  term  of  this  equation  gives  us  the 
average  value  of  the  quantity  in  brackets,  so  that  we  obtain  the  general 
equipartition  law 
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applying  in  all  cases  where  e  depends  in  the  way  stated  on  the  coordinate 
or  momentum  involved. 

78.  The  Equipartition  of  Energy. — There  are  many  cases  in  which 
a  variable  qi  contributes  merely  a  term  of  the  form 

ei=:\aiqi^  (129) 

to  the  energy  of  the  system,  where  ai  is  a  constant.  This  will  be  true 
when  Qi  is  a  displacement  with  an  elastic  force  which  obeys  Hooke's  law 
{lit  tenseo  sic  vis),  or  when  the  variable  is  an  ordinary  linear  or  angular 
momentum.    In  such  cases  the  general  law  (128)  evidently  reduces  to 

[eiU.  =  ikT  (130) 

and  we  have  the  law  of  the  equipartition  of  energy  for  such  coordinates 
or  momenta. 

79.  The  Specific  Heat  of  Gases. — We  may  immediately  use  this 
result  to  determine  the  specific  heat  of  gases. 

Since  monatomic  molecules  have  only  the  kinetic  energy  corresponding 
to  the  three  components  of  linear  momentum,  we  may  evidently  write  in 
accordance  with  equation  (130)  for  the  energy  and  heat  capacity  of  one 
mol  of  a  monato'mic  gas 

E  =  ^NkT  =  ^RT  C^  =  ^R  (131) 

where  N  is  Avogadro's  number. 

For  diatomic  gases,  where  the  molecules  can  be  regarded  as  rigid  dumb- 
bells capable  of  angular  momentum  around  the  two  principal  axes,  we 
have  to  add  the  energy  corresponding  to  these  two  further  momenta,  and 
obtain 

E  =  ^NkT  =  ^RT  C^  =  ^R  (132) 

For  polyatomic  gases,  provided  the  molecules  could  be  regarded  as 
rigid,  we  should  have  kinetic  energy  corresponding  to  still  one  more  mode 
of  rotation  and  should  obtain 

E  =  ^NkT  =  ZRT  C^  =  3R  (133) 

80.  On  comparison  with  experiment,  it  is  found  that  the  above  con- 
clusions with  regard  to  the  specific  heat  of  monatomic  gases  are  in  excel- 
lent agreement  with  the  facts.  In  the  case  of  diatomic  and  polyatomic 
gases,  however,  the  simple  results  expressed  in  equations  (132)  and  (133) 
have  to  be  modified  in  two  ways.     At  low  temperatures  the  actual  spe- 
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cific  heat  can  fall  below  that  given  by  the  equations,  since  as  we  shall 
later  see,  quantum  restrictions  must  be  considered,  which  act  in  such  a 
way  as  to  make  it  more  difficult  for  the  molecule  to  acquire  rotational 
energy  than  we  assumed.  On  the  other  hand,  at  high  temperatures  the 
specific  heat  may  rise  above  that  given  by  equations  (132)  and  (133), 
since  the  molecules  begin  to  pick  up  vibrational  energy  and  can  no  longer 
be  regarded  as  rigid  structures.  These  matters  will  be  more  fully  dis- 
cussed after  we  have  introduced  the  further  restrictions  which  quantum 
theory  imposes  on  statistical  mechanics. 

81.  The  Specific  Heat  of  Solids. — Let  us  now  consider  one  mol 
of  a  solid,  composed  of  A^  atoms  and  hence  having  3iV  degrees  of  free- 
dom, and  let  the  atoms  be  bound  to  their  equilibrium  positions  by  elastic 
forces  which  obey  Hooke's  law.  The  total  energy  of  this  system  will 
evidently  be  a  homogeneous  quadratic  function  of  the  ZN  variables  giving 
the  positions  of  the  atoms  and  the  corresponding  3A^  linear  momenta. 
Applying  the  principle  of  the  equipartition  of  energy  as  given  by  equa- 
tion (130),  we  shall  then  evidently  obtain  for  the  energy  and  specific 
heat  of  the  solid. 

E  =  ^NkT  =  ZRT  C^  =  3R  (134) 

This  result  is  in  accordance  with  the  well-known  law  of  Dulong  and 
Petit  for  the  specific  heat  of  solid  elements. 

82.  Here  too,  however,  we  find  deviations  from  the  simple  law,  due 
to  the  interference  of  quantum  restrictions,  which  will  be  thoroughly 
discussed  in  a  later  chapter.  In  general,  at  low  temperatures  the  specific 
heat  falls  below  that  given  by  equation  (134),  owing  to  the  fact  that 
the  vibrational  energy  of  the  system  must  be  quantized ;  so  that  equa- 
tion (134)  becomes  a  limiting  law  usually  holding  fairly  well  when  we 
get  up  to  room  temperatures.  On  the  other  hand  in  the  case  of  strongly 
metallic  solid  elements,  the  specific  heat  often  rises  above  the  value  3R 
owing  presumably  to  the  participation  of  electrons  in  the  thermal  motion. 

83.  The  Partition  of  Energy  in  General. — From  the  above  dis- 
cussion in  Section  78,  it  is  evident  that  the  law  of  the  equipartition  of 
energy  is  a  special  case  holding  when  the  energy  depends  in  a  particular 
way  on  the  coordinates  and  momenta.  In  the  more  general  case  of  any 
system  of  such  a  nature  that  e  may  be  regarded  as  being  the  energy 
of  an  individual  molecule,  we  may  evidently  obtain  in  accordance  with 
principles  of  averaging  the  average  energy  of  a  molecule  from  the  equation 

Bay.  =  I  ...   \  Cc~^  kf  e  dqi  .  . .  dpn  (135) 
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which,  introducing  the  value  of  C,  given  by  equation  (174)  in  Chapter  4, 
can  evidently  be  rewritten  in  the  form 


I    ...    I   ^        kT  dqi  .  .  .  dpn 


And,  in  case  it  is  possible  to  separate  out  the  portion  Si  of  the  energy 
associated  with  a  single  variable  qi,  we  can  write 

I  e'^'kf  Ei  dqi 

l^iU=^   _^^ (137) 

I  e      kT  dqi 

These  equations  will  permit  a  general  treatment  of  the  partition  of  energy, 
provided  the  dependence  of  e  on  the  coordinates  and  momenta  qi  .  .  .  .  pn 
is  known. 

84.  In  some  cases  the  dependence  of  e  upon  the  q's  and  p's  is  such  as 
to  make  the  integrations  indicated  in  equation  (136)  very  difficult  or 
possible  only  by  approximation.  It  is  then  often  convenient  to  make  use 
of  the  general  equipartition  law  for  the  quantity  [qids/dqi]  as  given  by 
equation  (128).  By  comparing  the  value  of  this  quantity  with  the  value 
of  the  energy,  it  may  then  be  possible  to  obtain,  in  a  very  easy  way,  at 
least  an  idea  of  the  actual  partition. 

Using  one  or  the  other  of  these  methods  of  attack,  we  may  now  con- 
sider the  partition  of  energy  in  a  number  of  cases  where  equipartition 
does  not  prevail. 

85.  Energy  Proportional  to  Any  Power  of  the  Variable. — Con- 
sider for  example  a  case  where  the  energy  associated  with  the  /th  co- 
ordinate is  given  by  the  expression 

Si=aqi^  (138) 

which  could  be  the  form  in  a  hypothetical  «ow-harmonic  oscillator.  Sub- 
stituting this  expression  in  equation  (137)  and  performing  the  integra- 
tions it  can  easily  be  shown  that  we  obtain  in  general 

kT 

[eiU.  =  ~  (139) 

n 

which  reduces  to  the  familiar  yzkT  when  the  energy  is  proportional  to  the 
square  of  the  coordinate. 
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86.  Oscillators  not  Obeying  Hooke's  Law. — Actual  oscillating  sys- 
tems usually  differ  from  ideal  harmonic  oscillators  in  that  the  elastic 
restoring  force  falls  below  proportionality  to  the  displacement  as  the 
latter  gets  larger.  Hence  the  energy  does  not  continue  to  build  up 
proportionally  to  the  square  of  the  displacement  when  the  displacement 
becomes  large,  and  indeed  usually  falls  off  increasingly  as  the  displace- 
ment continues  to  increase.  This  could  be  allowed  for  by  using,  for 
the  potential  energy  ei  corresponding  to  the  displacement  qi,  the  follow- 
ing expression  which  may  be  regarded  as  an  empirical  formula, 

e^=  aq^—hqi^  —  cqi^  —  ....  (140) 

a,  b,  c,  etc.,  being  positive  quantities.  Applying  our  general  equiparti- 
tion  law  as  given  by  equation   (128),  we  obtain 

[qi{2aqi  —  Ahq^  —  6cq^  —  ...)^^^,  =  kT  (141) 

Dividing  by  two  and  combining  with   (140)  we  obtain 

[£,],,.  =  lkT-\-  [bqi'  +  2cqi'  -f  .  .  .]av.  (142) 

The  thing  of  interest  is  to  note  that  the  deviation  from  the  usual  equi- 
partition  value  is  in  the  direction  of  increased  energy  content  and  increased 
heat  capacity. 

Some  care  must  be  used  in  connection  with  the  above  treatment,  since 
our  method  of  deriving  the  general  equipartition  law  evidently  limits  the 
application  to  cases  where  £i  may  be  regarded  as  approaching  infinity 
with  qi,  and  this  assumption  might  cease  to  be  justifiable  at  tempera- 
tures so  high  that  very  large  displacements  became  frequent.  In  addition 
the  form  of  the  empirical  equation  (140)  would  not  be  suitable  for  large 
displacements. 

If  the  deviation  from  Hooke's  law  is  in  the  opposite  direction,  we 
shall  evidently  obtain  lower  energies  and  heat  capacities  than  the  usual 
equipartition  values.  This  possibility  may  be  of  importance  in  the  case 
of  large  oscillations  of  the  atoms  of  a  crystal. 

Too  much  stress,  however,  must  not  be  laid  on  these  possibilities,  until 
after  we  have  investigated  the  very  large  deviations  from  equipartition 
which  are  due  to  the  quantum  theory. 

87.  Particles  Obeying  the  Relativity  Laws  of  Motion. — Another 
interesting  case  of  deviation  from  equipartition  arises  when  we  use  the 
exact  relativity  laws  for  the  motion  of  the  molecules  of  a  gas  instead 
of  the  approximate  Newtonian  laws  which  are  true  only  at  low  velocities. 

In  relativity  mechanics  the  total  energy  of  a  particle  is  given  by  the 
expression 
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where  ni  is  the  mass  of  the  particle  at  rest,  u  is  its  velocity  and  c  the  veloc- 
ity of  light;  and  the  three  components  of  momenta  are  given  by  the 
equations 

mi-             ^                 my             ,                  mi  /■iaa\ 

V  1  —  u^/c^  V  1  —  u^/c^  VI MV^ 

Noting  that 

u-  =  i-2  _i-  y2  _j_  ^2  ( 145^ 

and  defining  what  may  be  called  the  total  momentum  by  the  equation 

P' =  Pa^' +  P/ -^  P^'  (146) 

we  can  obtain  by  substitution  in  (143)  the  following  expression  for  the 
energy  in  terms  of  the  momentum 


s  =  cV  p^-{-m^c^  (147) 

We  may  now  write  the  Maxwell-Boltzmann  distribution  law  in  the 
form 

dN  =  NCe  kT  dx  dy  dz  dpxdpydpz  (148) 

or  integrating  over  the  total  volume  of  the  vessel  V  and  changing  for  con- 
venience to  polar  coordinates  we  obtain 


c  %//>■  +  m-(r 

dN  =  NCVe  kf         p^  sm^dM<pdp  ( 149) 

and  integrating  for  all  directions 

dN  =  AnNCVe  ~  '    '^ kr  '" '   p^^dp  (150) 

Referring  to  equation  (136),  we  can  now  write  for  the  average  energy 
of  a  particle 

e H^T cVp^-i-m^c^    p^dp 

\     e  kT         P  dp 

Unfortunately  there  appear  to  be  no  general  algebraic  expressions 
for  the  above  integrals  and  we  must  adopt  some  method  of  approximation. 

88.  Energy  Partition  in  Accordance  v^^ith  Relativity  Theory  in 
the  Case  of  Very  High  Temperatures  or  Very  Small  Masses. — Since 
the  molecules  tend  to  assume  larger  and  larger  momenta  the  higher  the 


87-89]  ENERGY  PARTITION  79 

temperature,  we  may  neglect  the  constant  quantity  m^c^  in  comparison 
with  p^  in  carrying  out  our  integrations  provided  the  temperature  is  ex- 
ceedingly high.     Under  these  circumstances,  equation  (151)  reduces  to 


'kT  p^dp 


(152) 


in  terms  of  well-known  integrals.     Evaluating  we  obtain 

e,,.  =  ZkT   ■    E  =  3RT   ;    C,  =  3R  (153) 

We  thus  obtain  the  interesting  result  that  at  extremely  high  tempera- 
tures, the  heat  content  and  specific  heat  of  a  monatomic  gas  are  twice 
as  large  as  we  should  expect  on  the  basis  of  the  Newtonian  mechanics. 

It  is  also  interesting  to  note  that  the  reduction  to  the  form  given  by 
equation  (152)  would  be  true  at  all  temperatures,  provided  the  mass 
of  the  particle  at  rest  becomes  zero.  We  obtain  the  surprising  result, 
that  a  mol  of  particles  having  zero  mass  at  the  absolute  zero,  would 
acquire  the  energy  3RT  and  the  mass  ZRT/c^  at  higher  temperatures. 
Thus  a  mol  of  particles  which  approach  zero  mass  at  the  absolute  zero 
would  have  a  mass  of 

3  X  8.31  X  10^  X  300  _ 

]^Q21  /     X    iU 

grams  at  room  temperature  (300°  absolute).     This  suggests  a  fascinating 
and  very  likely  profitless  field  of  speculation. 

89.  Energy  Partition  in  Accordance  with  Relativity  Theory 
at  Any  Temperature.'— In  order  to  obtain  an  idea  of  the  energy  parti- 
tion at  any  temperature,  it  is  simplest  to  make  use  of  our  general  equi- 
partition  law  as  given  by  equation  (128).  Referring  to  equations  (146) 
and  (147)  for  momentum  and  energy,  we  evidently  obtain  from  the  gen- 
eral equipartition  law 

[^■^]^=  ^p.c{f^^m\^-y^~p,'\^^=kT  (154) 

Substituting  (144)  and  introducing  the  similar  equations  for  the  y  and  z 
components  we  obtain 
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or  by  combination 

L  V  1  U~/C-  J  av. 

We  may  compare  this  result,  which  we  have  obtained  from  the  gen- 
eral equipartition  law,  with  the  expression  which  we  must  have  on  the 
basis  of  the  relativity  theory  for  the  average  kinetic  energy  of  a  particle, 
namely  ^ 

[mc^  „"| 

If  we  expand  both  expressions  (156)  and  (157)  in  the  form  of  series 
and  combine  we  easily  obtain 

£av.  -  2^'^  -^  "^'    [S  c*  "^  8  c''  "^  128  c«  "^  •  ■  •  •  J  av. 

For  the  case  of  low  enough  temperatures,  the  velocities  of  the  molecules 
will  become  so  low  that  the  bracketed  terms  become  negligible,  and 
the  expression  reduces  to  the  familiar  3/2  kT  given  in  the  Newtonian 
mechanics.  For  the  case  of  somewhat  higher  temperatures  we  may  in- 
troduce a  first  approximation  by  substituting 


[u' 


-  =  ''{-) 

\  m  / 


which  is  the  value  it  has  in  the  Newtonian  mechanics.     We  obtain 

^  =  l^^+m{^y+---  (158) 

where  M  is  the  molecular  weight. 

It  will  be  noted  as  might  be  expected  that  the  correction  term  is  larger 
the  smaller  the  mass  of  the  particles  of  the  gas,  and  the  higher  the  tem- 
perature. If  we  go  to  a  gas  with  a  molecular  weight  as  low  as  that  of 
the  electron,  about  1/1800  that  of  monatomic  hydrogen,  we  find  at  room 
temperature  that  the  correction  term  would  only  be  6  X  lO'*^  per  cent  of 
the  first  term  and  even  in  a  star  having  temperature  of  30,000°  it  would 
still  be  only  6  X  10"*  per  cent.  Hence  for  all  ordinary  temperatures  we 
may  expect  the  partition  of  energy  to  be  that  given  by  the  Newtonian 
mechanics  for  all  particles  with  a  mass  as  great  as  that  of  the  electron. 

90.  Gas  Laws  and  Relativity  Theory.— These  deviations  from  the 
Newtonian  theory  for  particles  which  obey  the  relativity  laws  of  motion 

^  To  obtain  the  kinetic  energy,  the  "internal''  energy  of  the  particle  at  rest  mc'^ 
must  be  subtracted  from  the  total  energy. 
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suggest  an  inquiry  as  to  the  validity  of  the  gas  laws  in  relativity  me- 
chanics. It  can  easily  be  shown,  however,  that  also  in  the  relativity 
theory  the  laws  of  Boyle,  Charles  and  Avogadro  will  be  true. 

Since  the  A'-component  of  momentum  for  a  particle  of  mass  m  and 
component  of  velocity  x  is  mx/  V 1  —  u'^/c'^.,  it  is  easy  to  see  in  accordance 
with  the  methods  of  Section  50  Chapter  4  that  the  pressure  will  be  given 
by  the  equation 


A^  r  mi'^        "I 


where  A^  is  the  total  number  of  particles  in  the  volume  V.  In  accordance 
with  the  general  equipartition  law,  however,  the  quantity  in  the  brackets 
has  the  average  value  kT  as  shown  in  equation  (155),  so  that  we  obtain 
at  once  the  complete  expression  embodying  all  three  of  the  gas  laws 

PV  =  NkT 


Chapter  7 
The  Elements  of  Quantum  Theory  Mechanics 

91.  Introduction. — It  would  be  unwise  to  proceed  further  with  the 
development  of  statistical  mechanics  without  first  introducing  the  hypothe- 
ses of  the  quantum  theory,  since  too  many  of  our  results  would  need 
modification  before  they  could  be  applied  to  actual  conditions,  if  we 
based  them  solely  on  our  present  foundation.  Indeed,  as  indicated  in 
the  last  chapter,  some  of  the  results  we  have  already  obtained  need  to 
be  modified  under  certain  conditions.  However,  since  all  the  conclusions 
of  the  classical  mechanics  are  subject  to  revision  in  the  light  of  the 
quantum  theory,  we  need  feel  no  chagrin  because  of  the  necessity  of 
modifying  the  results  of  the  older  statistical  mechanics. 

Interestingly  enough,  the  first  deviations  from  the  predictions  of 
the  classical  mechanics  were  actually  encountered  in  the  very  field  of 
statistical  mechanics  itself,  in  connection  with  predictions  as  to  the  dis- 
tribution of  radiant  energy  in  an  enclosure  which  has  come  to  tempera- 
ture equilibrium  with  the  walls.  These  deviations  led  to  the  first  intro- 
duction of  a  quantum  hypothesis  by  Planck  in  the  year  1900.  Since 
that  time  a  large  number  of  further  facts  have  been  discovered  not 
only  in  the  field  of  statistical  mechanics  but  more  especially  in  con- 
nection with  the  mechanics  of  individual  atoms  or  molecules,  all  of 
which  indicate  even  more  clearly  the  necessity  of  some  such  modification 
as  proposed  by  Planck. 

These  later  facts  as  to  the  mechanics  of  single  atoms  form  of  course 
a  superior  starting  point  for  the  foundation  of  a  quantum  theory,  than 
could  possibly  be  furnished  by  facts  in  such  a  complicated  field  as  sta- 
tistical mechanics  itself.  Hence  we  shall  first  consider  the  modifying 
postulates  which  must  be  introduced  into  ordinary  mechanics  in  order 
to  account  for  the  behavior  of  individual  atoms,  and  then  extend  the 
application  of  these  postulates  to  the  field  of  statistical  mechanics  in  order 
to  treat  the  behavior  of  systems  containing  many  atoms  or  molecules. 

92.  Present  Status  of  the  Quantum  Theory. — At  the  time  the 
manuscript  for  this  chapter  was  first  prepared,  the  quantum  theory 
had  reached  a  stage  in  its  development  which,  although  evidently  not 

82 
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entirely  satisfactory,  nevertheless  had  great  definiteness  and  apparently  a 
certain  degree  of  permanency.  The  status  at  the  time  was  that  so  ex- 
cellently presented  in  the  fourth  edition  of  Sommerfeld's  "Atombau  und 
Spektrallinien"  (1924)  or  in  Van  Vleck's  report  on  "Quantum  Principles 
and  Line  Spectra"  (1926). 

At  the  present  moment,  however,  the  quantum  theory  is  in  a  very 
rapid  state  of  new  development,  owing  principally  to  the  work  of 
Heisenberg,^  Born  and  Jordan,-  and  Schrodinger  ^  in  devising  new  forms 
of  dynamics  which  are  specially  adapted  for  treating  motions  that  are 
subject  to  quantum  restrictions.  This  present  rapid  flux  is  unfortunate 
from  the  point  of  view  of  our  immediate  necessity  of  giving  an  outline 
of  quantum  theory  mechanics,  upon  which  to  base  the  further  develop- 
ment of  our  statistical  mechanical  considerations.  The  older  definite 
form  of  the  quantum  theory  is  sure  to  be  greatly  modified  in  the  future, 
and  yet  the  precise  nature  of  the  modifications,  which  will  prove  most 
satisfactory,  is  still  uncertain. 

Under  the  circumstances  it  seems  wisest  to  present  our  outline  of 
quantum  theory  mechanics  in  its  older  definite  form,  and  we  shall  offer 
no  further  apologies  for  so  doing.  It  may  be  remarked,  however,  that 
the  features  of  the  older  quantum  theory  which  are  most  certain  to  be 
modified  are  the  use  of  the  Wilson-Sommerfeld  rule  of  quantization  for 
the  theoretical  calculation  of  the  energies  of  the  quantized  states,  and  the 
use  of  the  Bohr  correspondence  principle  for  the  theoretical  calculation 
of  the  intensities  of  spectral  lines.  Nevertheless,  since  the  energy  levels 
of  the  quantized  states  and  the  intensities  of  spectral  lines  are  both  of 
them  subject  to  direct  experimental  determination,  mere  changes  in  the 
methods  of  their  theoretical  prediction  will  not  affect  the  general  stand- 
point that  we  must  assume  in  adapting  statistical  mechanics  to  take 
account  of  quantum  phenomena.  On  the  other  hand,  there  is  no  tend- 
ency in  the  newer  dynamics  which  would  indicate  the  necessity  of  abandon- 
ing either  the  Bohr  relation  between  the  energy  levels  of  an  atom  and 
the  frequency  of  the  radiation  which  they  emit  or  absorb,  or  Einstein's 
general  relations  connecting  the  probabilities  of  absorption  and  emission, 
or  the  procedure  of  assigning  an  area  in  the  phase  space  of  magnitude  h 
to  each  quantized  degree  of  freedom ;  and  these  are  the  features  of 
quantum  mechanics  which  are  essential  for  the  extension  of  statistical 
mechanics  which  will  be  presented. 

Let  us  now  turn  to  the  discussion  of  the  new  postulates  which  have 

'Heisenberg,  Zeitschr.  fiir  Phys.,  33,  879  (1925). 

"Born  and  Jordan,  Zeitschr.  fi'ir  Phys.,  34,  858   (1925);  Born,  Heisenberg  and 
Jordan.  Zeitschr.  fiir  Phys.,  35,   557    (1926). 

"Schrodinger,  Ann.  der  Phys.,  79,  361,  489,  734  (1926). 
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been  introduced  in  the  attempt  to  modify  classical  dynamics   so  as  to 
take  account  of  quantum  phenomena/ 

93.  The  Quantum  States. — The  first  modification  of  classical  views 
which  must  be  introduced  is  to  recognize  the  existence  of  definite  in- 
ternal states  of  atoms  or  molecules,  in  which  the  atom  can  persist  for 
an  appreciable  length  of  time  without  loss  of  energy.  These  are  the  so- 
called  quantum  states  of  the  atom.  Since  they  correspond  to  different 
quantities  of  internal  energy  they  give  the  atom  a  series  of  energy  levels, 
and  it  is  found  that  the  atom  can  exist,  at  least  for  an  appreciable  time, 
only  at  one  or  another  of  these  levels,  and  may  then  change  suddenly  to 
some  other  level. 

We  now  have  very  definite  experimental  evidence  for  the  existence 
of  these  energy  levels,  obtained  by  the  method  of  electronic  impact.  In 
using  this  method  we  arrange  to  bombard  the  atoms  of  a  dilute  gas 
with  electrons  coming  from  a  hot  filament  with  a  definite  velocity  and 
kinetic  energy  corresponding  to  a  fall  through  a  known  potential  drop. 

If  the  energy,  obtained  from  this  potential  drop,  is  less  than  that 
necessary  to  raise  the  atoms  from  their  normal  energy  level  to  the  next 
higher  level,  we  find  that  only  elastic  collisions  can  take  place  between 
the  atoms  and  electrons.  This  can  be  shown  for  example  by  determin- 
ing the  magnitude  of  an  opposing  potential  drop  that  the  electrons  can 
traverse  after  they  have  made  many  such  collisions.  If,  however,  we 
increase  the  initial  energy  of  the  electrons  to  a  value  slightly  greater 
than  that  between  the  first  two  levels  of  the  atom,  inelastic  impacts  im- 
mediately take  place,  in  which  the  kinetic  energy  of  the  electron  is  used 
in  raising  the  atom  to  the  higher  level.  These  inelastic  impacts  are  im- 
mediately made  evident  by  the  inability  of  electrons,  which  have  made 
such  collisions,  to  traverse  electric  fields  which  are  easily  overcome  by 
electrons  which  start  out  initially  with  less  energy.     Hence  it  seems  as  if 

(the  existence  of  definite  energy  levels  or  quantum  states  could  be  taken 
as  an  undoubted  fact. 

94.  As  to  the  different  internal  configurations  of  the  atom  or  mole- 
cule which  correspond  to  different  quantum  states,  there  can  be  a  wide 
variety.  In  the  case  of  monatomic  hydrogen,  the  simplest  of  all  atoms, 
composed  merely  of  a  single  electron  and  a  single  proton,  we  believe  that 
the  different  quantum  states  correspond  to  different  orbits  in  which  the 
electron  can  rotate  around  the  proton.  In  more  complicated  systems, 
such  as  a  molecule  of  hydrogen  chloride,  we  have  different  states  corre- 

*  In  the  final  theory,  the  classical  dynamics  may  take  its  place  as  a  special 
limiting  case  of  a  more  general  d)Tiamics.  In  the  status  of  the  theory,  which 
we  are  presenting,  the  quantum  postulates  enter  as  restrictions  on  classical  dynamics 
when  motions  involving  high  frequencies  are  involved. 
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spending  to  different  velocities  of  rotation  of  the  molecules  as  a 
whole,  and  to  different  velocities  of  oscillation  of  the  two  nuclei  along 
the  line  joining  their  centers.  Finally  as  an  extreme  we  may  regard 
the  different  isomeric  or  tautomeric  forms  of  an  organic  molecule  as 
different  quantum  states,  say  for  example  propyl  and  isopropyl  alcohol, 
or  the  enol  and  keto  forms  of  acetoacetic  ester. 

95.  The  development  of  a  theoretical  method  of  calculating  the  in- 
ternal configurations  and  motions  of  a  molecule  which  correspond  to 
different  quantum  states  has  formed  one  of  the  main  problems  of  the 
quantum  theory.  The  method  employed  in  the  status  of  the  theory 
which  we  are  presenting  consists  in  selecting  from  the  continuous  series 
of  motions  of  different  energy  content  which  accord  with  the  classical 
mechanics  a  discrete  series  of  "quantized"  motions  which  fulfill  certain 
special  requirements.  The  criterion  for  this  selection  is  known  as  the 
Wilson-Sommerfeld  rule  of  quantization. 

On  the  basis  of  the  classical  mechanics,  if  we  have  a  steady  state 
of  internal  motion,  it  is  evident  that  the  electrons  and  nuclei  composing  a 
molecule  would  be  executing  periodic  or  more  generally  conditionally 
periodic  motions.  In  such  a  conditionally  periodic  motion  there  is  a 
fundamental  period  associated  with  each  degree  of  freedom  which  is  the 
time  necessary  for  the  corresponding  coordinate  and  momentum  to  re- 
turn to  a  given  starting  place.  If  now  we  consider  the  coordinate  q 
and  momentum  p  for  any  particular  degree  of   freedom,  and  take  the 

value  of  the  phase  integi'al  I  pdq  over  a  complete  period,  the  Wilson- 
Sommerfeld  rule  of  qiuintization  states  that  this  integral  must  be  equal 
to  some  simple  multiple  of  Planck's  constant  h,  if  the  molecule  is  in  a 
steady  quantized  state  of  motion.  The  rule  can  be  stated  in  the  form  of 
the  following  equation 


/ 


pidqi  =z  mh  (159) 


The  circle  on  the  integral  sign  indicates  that  the  integration  is  to  be 
taken  over  the  complete  period  belonging  to  the  degree  of  freedom  in 
question  (i.e.  to  the  i'th  degree  of  freedom).^  And  the  quantum^  number 
Hi  gives  the  particular  multiple  of  h  that  the  phase  integral  equals. 

"  In  case  there  is  indefiniteness  as  to  the  period  over  which  the  integration  is 
to  be  taken,  we  may  encounter  the  phenomenon  of  "weak"  quantization.  Under  these 
circumstances  there  appears  to  be  a  tendency  for  the  system  to  assume  a  state 
which  corresponds  to  the  quantization  of  the  truly  periodic  motion  with  which 
the  actual  nearly  periodic  motion  is  in  approximate  agreement.  See  Ehrenfest  and 
Tolman,  Phys.  Rev.,  24,  287  (1924).  The  status  of  weak  quantization  in  the 
newer  dynamics  is  not  yet  apparent.  It  seems  possible  that  the  idea  will  no  longer 
be  valuable. 
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For  different  quantized  states  this  quantum  number  will  assume  differ- 
ent values  which  can  differ  from  each  other,  however,  only  by  integral 
amounts.  In  addition  it  is  found  experimentally  that  the  lowest  possible 
value  of  the  quantum  number  is  either  zero  or  one-half  depending  on 
the  kind  of  degree  of  freedom  involved.  Hence  for  the  different  quantized 
states  of  a  molecule,  the  quantum  numbers  for  the  various  degrees  of 
freedom  present,  all  have  values  n  =  0,  I,  2,  3  .  .  .  ov  n  =  1/2,  3/2, 
5/2  ... . 

We  thus  see  that  the  Wilson-Sommerfeld  rule  of  quantization  selects 
out  from  the  different  classically  allowed  motions,  which  would  present 

a  continuous  range  of  values  for  the  different  phase  integrals  Ii=  (p  pidqi 


a  discrete  set  of  allowed  quantized  motions  with  definite  values  of  the 
phase  integrals  and  corresponding  definite  values  of  the  energy  content  of 
the  molecule. 

96.  The  further  development  of  quantum  theory  dynamics  is  appar- 
ently going  to  be  in  the  direction  of  concentrating  attention  on  the 
quantized  states  of  motion  and  giving  up  any  idea  of  the  existence  of 
the  intermediate  classically  possible  motions.  The  development  will  prob- 
ably involve  the  abandonment  of  the  Wilson-Sommerfeld  rule  of  quantiza- 
tion. The  idea  of  the  existence  of  states  of  definite  energy  content,  how- 
ever, which  is  the  result  of  inescapable  experimental  evidence,  cannot 
be  abandoned  and  this  is  the  idea  of  importance  for  our  further  develop- 
ment of  statistical  mechanics. 

97.  Mechanism  of  Passage  of  Molecules  from  One  Quantum 
State  to  Another. — Having  established  the  existence  and  to  some  extent 
the  character  of  the  different  quantum  states,  we  are  next  concerned 
with  the  processes  by  which  a  molecule  can  pass  from  one  quantum 
state  to  another.  Experimentally  we  find  that  this  can  occur  either  by 
interaction  with  radiation  or  by  collision  with  other  molecules. 

Thus  a  molecule  can  pass  from  a  lower  to  a  higher  quantum  state 
by  the  absorption  of  radiation  and  in  the  reverse  direction  by  emission. 
Furthermore  a  molecule  can  collide  with  another  molecule  and  pass  from  a 
lower  to  a  higher  quantum  state,  by  drawing  on  the  supply  of  kinetic 
energy  which  is  available  (called  collision  of  the  first  kind)  or  by  taking 
internal  energy  from  the  other  molecule  if  the  latter  is  itself  in  an 
excited  state  (called  collision  of  the  second  kind).  On  the  other  hand, 
a  molecule  can  pass  from  a  higher  to  a  lower  quantum  state  as  a  result 
of  collision,  either  by  giving  up  its  energy  of  excitation  in  the  form  of 
kinetic  energy  or  by  transferring  at  least  a  portion  of  this  energy  to  be 
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used  in  raising  the  molecule  with  which  it  has  collided  into  a  higher 
quantum  state   (both  called  collisions  of  the  second  kind). 

The  experimental  facts  demonstrating  the  passage  of  molecules  from 
one  quantum  state  to  another  by  the  absorption  and  emission  of  radia- 
tion and  by  collisions  of  the  first  and  second  kind  are  very  satisfactory. 
Concerning  the  internal  mechanism  of  these  changes,  however,  we  still 
have  only  partial  knowledge.  There  are,  however,  two  characteristics 
of  these  processes  concerning  which  we  have  definite  information. 

98.  In  the  first  place  when  the  passage  from  one  state  to  another 
occurs  through  the  emission  or  absorption  of  radiation,  there  is  a  definite 
relation  between  the  energy  lost  or  gained  and  the  frequency  of  the 
radiation.  This  relation  is  known  as  Bohr's  frequency  postulate  and  can 
best  be  stated  in  the  form  of  the  equation 

AE  =  hv  (160) 

where  zlii  is  the  difiference  in  energy  between  the  initial  and  final  states 
of  the  molecule,  v  is  the  frequency  of  the  absorbed  or  emitted  light  and  h 
is  Planck's  constant.^ 

99.  In  addition  to  this  knowledge  as  to  the  frequencies  involved  when 
the  transfer  between  states  takes  place  through  interaction  with  radia- 
tion, we  can  assert  in  general,  that  when  molecules  pass  from  one  quantum 
state  to  another  by  any  process,  the  laws  of  the  conservation  of  energy 
and  of  linear  and  angular  momentum  must  hold.  This  conclusion  is 
necessarily  true,  at  least  statistically,  since  otherwise  these  laws  could 
not  hold,  as  we  know  they  do,  for  macroscopic  systems  composed  of 
molecules.  In  addition  the  experiments  of  Compton  and  Simon  "^  and 
of  Bothe  and  Geiger  ®  indicate  at  least  in  one  case  that  the  conservation 
laws  hold  as  well  for  individual  molecular  processes. 

The  conservation  laws  for  energy  and  momentum  have  proved  valu- 
able in  furnishing  simple  quantum  theory  explanations  of  the  Doppler 
effect  and  the   Compton   efifect,   while   the   conservation   of   angular   mo- 

'  Equation  (160)  would  indicate  that  the  light  emitted  by  a  system  of  molecules 
should  be  absolutely  unifrequentic  provided  the  temperature  is  low  enough  so  as 
to  ehminate  the  Doppler  effect  and  the  molecules  far  enough  apart  to  eliminate 
deviations  in  the  energy  of  the  molecules  due  to  their  mutual  interactions.  This 
perhaps  would  be  the  view  advocated  by  the  most  extreme  holders  of  the  so-called 
light-quantum  theory.  It  seems  possible,  however,  as  an  alternative  that  spectral 
lines  become  sharper  and  sharper,  the  less  "weak"  the  quantization  (see  preceding 
footnote),  and  are  never  absolutely  unifrequentic.  Equation  (160),  however,  is 
entirely  satisfactory  for  the  very  sharp  lines  that  we  encounter  in  the  usual 
case  of  "strong"'  quantization. 

'Compton  and  Simon,  Phys.  Rev.,  26,  289  (1925). 

•Bothe  and  Geiger,  Zeits.  fur  Phys.,  32,  639   (1925).  , 
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menttim  can  be  used  to  interpret  the  polarization  of  the  radiation  emitted 
when  an  atom  falls  from  one  quantum  state  to  another. 

100.  Probability  of  Passage  from  One  Quantum  State  to  An- 
other.— We  must  next  inquire  as  to  the  frequency  with  which  mole- 
cules pass  from  one  quantum  state  to  another,  and  since  the  internal 
mechanism  of  the  transitions  from  state  to  state  is  largely  unknown, 
we  can  merely  ask  as  to  the  probability  of  transition  under  a  given  set  of 
external  conditions. 

101.  In  the  case  of  changes  involving  the  interaction  of  radiation, 
we  have  definite  evidence  from  absorption  experiments  that  the  prob- 
ability of  transition  from  a  loiver  to  a  higher  quantum  state  is  directly 
proportional  to  the  intensity  of  the  radiation  needed  for  the  change. 
While  for  a  transition  from  a  higher  to  a  lotver  quantum  state,  the 
measurements  of  Wien  °  on  the  decay  in  the  intensity  of  the  light  coming 
from  the  excited  atoms  in  a  canal  ray,  have  shown  us  that  we  must 
ascribe  to  excited  atoms  a  definite  probability  in  unit  time  of  spontaneously 
emitting  energy  and  dropping  to  a  lower  quantum  state. 

102.  As  the  quantitative  laws  governing  the  probabilities  of  absorp- 
tion and  emission  we  shall  accept  the  views  presented  by  Einstein  ^°  in 
1917.  In  accordance  with  these  views,  the  chance  per  second  that  a 
molecule  will  pass  from  a  lower  state  i  to  a  higher  state  /  by  the  absorp- 
tion of  radiation  will  be  Bi^jU^  where  Bi_^j  is  a  proportionality  con- 
stant depending  on  the  nature  of  the  states,  and  u^  is  the  radiation  density 
at  the  frequency  v  which  is  involved  in  the  transfer.  For  the  probability 
of  a  transition  in  the  opposite  direction  from  state  /  to  state  /,  with  the 
emission  of  radiation,  we  shall  have  the  sum  of  two  terms  Aj^i  + 
Bj^iU^,  where  Aj^i  is  the  chance  per  second  of  spontaneous  emission, 
and  Bj^i  is  a  proportionality  constant  for  induced  emission  under  the 
action  of  radiation  of  the  frequency  v.  The  implication  of  these  laws 
will  be  more  fully  treated  in  Chapter  16,  where  theoretical  relations  con- 
necting the  three  proportionality  constants  will  be  obtained. 

103.  The  chance  of  spontaneous  emission  Aj^i  will  evidently  de- 
termine the  mean  life  of  an  excited  molecule  in  the  absence  of  collisions 
with  other  molecules  and  in  a  field  free  enough  from  radiation  so  that 
we  can  neglect  the  probability  of  the  induced  transition.  If  there  is 
only  a  single  lower  state  /  to  which  molecules  in  the  state  /  can  jump, 
the  mean  life  in  the  upper  state  will  evidently  be  given  by 

'W.  Wien,  Ann.  der  Physik,  73,  483   (1924)   and  earlier  work. 
"Einstein,  Phys.  Zts.,  18,  121  (1917). 
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^  =  ^—.  (161) 

and  in  case  there  are  a  number  of  lower  states  i,  we  must  obviously  write 

x  =  ^-^  162) 

i 

Actual  values  of  the  mean  life  may  vary  all  the  way  from  the  neigh- 
borhood of  10"'  seconds  for  a  mercury  atom  excited  to  the  2po  state,  to 
the  comparatively  long  life  of  an  oscillating  hydrogen  chloride  molecule 
and  the  practically  indefinite  life  of  an  organic  isomer.^^ 

104.  In  interpreting  this  life  time  of  excited  atoms  and  molecules, 
we  encounter  some  difficulty  in  distinguishing  between  the  time  during 
which  the  atom  stays  in  its  upper  quantum  state  (Verweilzeit)  and  the 
time  occupied  in  travelling  from  the  upper  to  the  lower  state  (Abklingung- 
zeit).  Indeed  it  is  possible  that  our  present  concepts  of  time  are  in- 
adequate for  the  treatment  of  interatomic  phenomena.  Nevertheless, 
we  obtain  for  the  present  a  very  satisfactory  representation  of  reality 
if  we  assume  that  the  time  taken  by  the  atom  in  travelling  from  one  state 
to  another  is  negligibly  short  compared  with  the  time  spent  in  the  differ- 
ent quantum  states. 

105.  Turning  now  to  the  probability  of  transition  from  one  state  to 
another  by  collision  with  other  molecules,  we  can  merely  say  that  the 
chance  of  transition  will  be  proportional  to  the  number  of  collisions 
occurring  and  to  proportionality  constants  whose  inner  meaning  is  still 
to  be  elucidated,  although  certain  theoretical  relations  connecting  the  prob- 
abilities of  transition  in  reverse  directions  will  be  derived  in  Chapter  17. 

On  the  experimental  side  the  work  of  Glockler  (see  Chapter  17)  in- 
dicates that  the  transition  of  an  atom  from  a  lower  to  a  higher  quantum 
state  by  electron  impact,  has  an  appreciable  probability  of  occurrence,  only 
when  the  electron  has  not  more  than  a  very  slight  excess  over  the  neces- 
sary amount  of  kinetic  energy  to  make  the  change  possible  at  all.  And 
with  regard  to  the  transfer  of  energy  of  excitation  from  one  atom  to 
another  the  work  of  Stuart  (see  Chapter  21)  has  shown  that  the  prob- 
ability for  this  to  occur  is  a  very  specific  property  of  the  atoms  involved. 

106.  Classical  Mechanics  as  a  Limiting  Case. — In  the  method  of 
presentation  chosen  for  the  purposes  of  this  chapter,  we  have  considered 
that  the  quantum  theory  mechanics  arises  from  the  classical  mechanics 

"  See  Tolman,  "Duration  of  Molecules  in  Upper  Quantum  States,"  Phys.  Rev., 
23,  693  (1924),  and  Chapter  16  of  this  book. 


90  STATISTICAL  MECHANICS  [Ch.  7 

by  the  introduction  of  extra  restrictions,  which  were  not  appreciated  until 
the  world  of  atomic  phenomena  had  been  studied.  For  the  final  method 
of  presentation  which  will  ultimately  be  satisfactory,  it  seems  possible 
that  a  more  general  dynamics  will  be  invented  of  which  the  classical 
dynamics  will  be  a  limiting  case.  Whatever  the  future  may  bring,  how- 
ever, it  is  already  evident  that  many  dynamical  phenomena — including 
the  motions  of  molar  masses  whose  study  led  to  the  classical  mechanics 
and  including  also  the  translational  motion  of  the  molecules  of  a  gas 
at  ordinary  temperatures,  which  has  been  of  main  interest  for  the  pre- 
ceding applications  of  statistical  mechanics — will  not  be  subject  to  quantum 
restrictions. 

To  show  this  let  us  return  to  the  Wilson-Sommerfeld  rule  of  quantiza- 
tion for  the  fth  degree  of  freedom  of  a  system 


/ 


pidqi  =  nih 


Referring  to  equation  (12)  in  Chapter  2  for  the  definition  of  pi,  we  can 
rewrite  this  equation  in  the  evident  form 

T  T 

\  pi -^dt  =   \  -^qidt^=  fiih  (163) 

0  0 

where  L  is  the  Lagrangian  function  for  the  system  and  t  is  the  period 
of  the  motion  for  the  particular  degree  of  freedom. 

For  an  ordinary  mechanical  system,  however,  the  Lagrangian  func- 
tion L  is  the  difference  between  the  kinetic  and  potential  energies  of  the 
system,  and  the  kinetic  energy  can  be  expressed  as  a  sum  of  quadratic 
terms  depending  on  the  velocities,  giving  us  ^^ 

L  =  T—V  =  ^imiqi'  —  V  (164) 

Hence  by  substituting  into  equation  (163)  we  can  obtain 


T 

J 


miQi^dt  —  fiih         or         [|  Wi^-i^Jav.  ==  2^  (165) 


Since  the  successive  quantized  states  for  fth  degree  of  freedom 
dififer  by  changes  of  unity  in  the  quantum  number  Wi,  we  see  that  for 
such  a  system  the  average  kinetic  energy  associated  with  this  degree  of 
freedom  will  increase  by  very  small  steps  whenever  the  period  of  the 
motion  x  becomes  long.     In  addition,  of  course,  the  kinetic  energy  could 

"  A  similar  treatment  is  also  possible  when  cross  products  of  the  velocities  appear 
in  the  expression  for  T. 
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be  regarded  as  increasing  practically  continuously  as  compared  with  that 
already  present,  whenever  we  are  in  the  region  of  large  quantum  num- 
bers. Finally  it  should  be  noted  that  for  all  ordinary  systems  small 
changes  in  potential  energy  will  be  associated  with  small  changes  in 
kinetic  energy. 

Thus,  since  the  energy  levels  will  lie  close  together,  we  see  that  the 
classical  dynamics  can  be  regarded  as  applying  to  any  degree  of  free- 
dom with  a  long  period  of  motion,  and  in  general  in  the  region  of  large 
quantum  numbers.  Hence  we  can  neglect  quantum  restrictions  in  the 
case  of  translational  motion,  and  in  the  case  of  periodic  motion,  provided 
the  period  is  long  or  the  number  of  quanta  of  energy  already  present  is 
large. 

107.  The  Correspondence  Principle. — Although  the  postulates 
already  introduced  in  the  preceding  sections  contain  all  that  we  need 
in  order  to  make  our  applications  to  statistical  mechanics,  nevertheless 
it  would  be  unfortunate  to  proceed  without  also  including  Bohr's  corre- 
spondence principle,  a  postulate  which  has  been  extraordinarily  finiitful 
in  the  development  of  the  quantum  theory.  This  postulate  demands  a 
certain  correspondence  between  the  rate  of  emission  of  radiation  by  mole- 
cules as  given  by  the  quantum  theory,  and  the  rate  of  emission  which 
would  have  been  predicted  on  the  basis  of  the  older  classical  theory. 
The  nature  of  the  postulates  can  best  be  understood  if  we  first  derive 
what  may  be  called  the  correspondence  theorem  which  connects  the  fre- 
qiiency  of  radiation  as  given  by  the  quantum  theory  with  the  frequency 
which  would  have  been  predicted  by  the  classical  theory.  We  shall  then 
be  in  a  better  position  to  understand  the  postulated  correspondence  be- 
tween the  rates  given  by  the  two  theories. 

108.  On  the  basis  of  the  classical  theory,  a  radiating  molecule  could 
be  treated  as  a  system  performing  a  conditionally  periodic  motion.  If 
such  a  system  has  s  degrees  of  freedom,  there  will  be  present,  in  the 
motion  of  the  system,  s  fundamental  frequencies  ^^  coi  .  .  .  .  cOg  which  are 
connected  with  the  energy  of  the  system  E  and  the  s  phase  integrals 
Ii  ....  Is  by  the  equation 

dE  =  coidli  +,...  +  cogdls  (166) 

the  phase  integrals  being  given  by  the  equations 


h=  j>  pidq,  , h  =  j> 


psdq,  (167) 

"  The  development  is  given  for  non-degenerate  systems  in  which  the  number 
of  degrees  of  freedom  is  the  same  as  the  number  of  different  fundamental  periods. 
The  modifications  for  degenerate  systems  will  be  obvious. 
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with  the  Hmits  of  integration  taken  around  the  complete  period  for  the 
degree  of  freedom  involved. 

In  accordance  with  the  classical  theory,  the  fundamental  frequencies 
coi  .  .  .  .  (Us  present  in  the  motion  of  the  molecule  and  the  frequencies  of 
their  combination  overtones  (tiCOi -f- •  •  •  •  +  ^s<Ws),  the  order  numbers 
T  being  integers,  would  be  the  frequencies  of  the  emitted  radiation.  So 
that  we  could  write  for  the  frequency  of  the  radiation 

''Classical    =   (^i^i  + +  Is^is)  (168) 

or  substituting  equation  (166) 


^'Classical   =  (  ^i  TT  + +  ^*-  TT  I  (169) 


dl^^  '"-^     '  dh 


109.  On  the  otber  hand  on  the  basis  of  the  quantum  theory  a  mole- 
cule can  also  be  treated  as  performing  conditionally  periodic  motions  in 
each  of  its  quantum  states,  but  instead  of  gradually  radiating  away  its 
energy,  it  suddenly  jumps  to  a  new  quantum  state,  the  change  in  energy 
AE  being  accompanied  by  radiation  of  frequency  v  in  accordance  with  the 
equation 

AE  =  hv  (170) 

and   the   change   in   quantum   state   being   such   that   the   phase   integrals 
change  by  whole  multiples  ol  li  in  accordance  with  the  equations 

Ah  =  Anji ,....,  AIs  =  Anji  ( 171) 

where  Atii  .  . .  .  Aus  are  integers  giving  the  changes  in  quantum  number 
that  occur. 

To  correlate  the  two  points  of  view,  we  may  now  rewrite  equation 
(170)  in  the  form 

AE        l/AE    .,     .  .   AE    .^\  ,,»^. 

or  substituting  equations   (171)  we  may  write 

V  Quantum  =   (  Au-^.  -jj-   + +  ^^^s^  J  (  173) 

110.  On  comparing  equations  (169)  and  (173),  we  observe  a  great 
similarity,  the  integral  order  numbers  and  differential  coefficients  occurring 
in  the  classical  theory  expression  being  replaced  by  the  integral  changes 
of  quantum  number  and  difference  coefficients  in  the  quantum  theory 
expression. 

This  result  may  be  called  the  correspondence  theorem.  It  indicates 
in  the  first  place  a  correspondence  between  any  given  harmonic  present 


^ 
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in  the  classical  motion  with  order  numbers  Ti  .  ,  .  .  !»  and  the  quantum 
transition  which  is  brought  about  by  changes  in  the  quantum  numbers  of 
amount  An^  n:  t^  , . .  .  Aug  =  is-  And  in  the  second  place  it  shows  at 
once  that  in  the  region  of  large  quantum  numbers,  where  we  have  a 
continuous  series  of  quantum  states  and  the  difference  coefficients  and 
differential  coefficients  become  identical,  we  can  expect  the  quantum 
theory  frequencies  which  are  actually  emitted  to  become  identical  with 
the  frequencies  calculated  on  the  basis  of  the  classical  theory. 

111.  This  correspondence  between  the  frequencies  calculated  on  the 
basis  of  the  two  theories  has  led  to  the  Bohr  postulate  of  a  relation  be- 
tween the  intensities  of  radiation  which  would  be  emitted  on  the  basis 
of  the  two  theories^ — a  relation  which  would  be  exact  in  the  region  of 
large  quantum  numbers  and  give  approximate  information  in  the  region 
of  small  quantum  numbers. 

112.  On  the  basis  of  the  classical  theory  it  was  possible  to  calculate 
the  intensity  of  radiation  corresponding  to  a  given  harmonic  with  the 
help  of  the  Fourier  analysis  of  the  motion  of  the  electric  charges  in  the 
molecule.  As  the  result  of  such  an  analysis  the  value  of  any  component 
of  the  electric  moment  of  the  molecule,  say  for  example  the  component 
parallel  to  the  Z-axis  could  be  expressed  by  an  equation  of  the  form 

I  =  2....  2 /?,,...,,  cos  271  [(tiCOi+..+t«co«)^  +  7,,...,.]     (174) 

where  the  quantities  ^^j...^,  are  the  amplitudes  corresponding  to  the 
different  harmonics  of  frequencies  (tiWi  +  •  •  •  •  +  ^sWs),  the  quantity 
y  is  a  phase  angle,  and  the    summations  are  to  be  taken   for  all 

values  of  the  integers  Ti  ....  is  from  minus  to  plus  infinity.  And  in  the 
classical  theory  the  rate  of  emission  of  any  frequency  (tiCUi  +  ....+ 
TsOis)  would  depend  on  the  value  of  the  corresponding  amplitudes 
■^Ti...r,    i"  accordance  with  the  equation 

/^\  =''^^,lD\,,..^X^,co,-^...  +  rs0^sy]       (175) 

\  (it    /Classical  «5^     ^,y,^ 

where  the  summation  sign  indicates  that  we  must  sum  for  the  amplitudes 
parallel  to  each  of  the  three  axes. 

113.  On  the  other  hand,  in  accordance  with  the  quantum  theory,  the 
rate  of  energy  emission  of  a  particular  frequency  per  molecule  in  the 
upper  state  would  evidently  be 

^)  =A.,....,hv  (176) 

at  /  Quantum 


( 
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where  A^^^^.^,  is  to  be  taken  as  the  value  of  Einstein's  probabiHty 
coefficient  for  the  spontaneous  transition  which  corresponds  to  the  har- 
monic under  consideration  by  resulting  from  changes  in  the  quantum 
numbers  An^  =  t^  . . , .  Ang  =  tg. 

114.  We  now  introduce  the  correspondence  principle  by  postulating 
an  approximate  relation  between  the  rates  as  given  by  equations  (175) 
and  (176)  which  can  be  written  in  the  simplified  form 

Ahv;:::,^  2   [D'o^]  (177) 

It  is  to  be  expected  that  this  relation  would  hold  exactly  in  the  region  of 
large  quantum  numbers  and  would  give  approximate  information  in  the 
region  of  small  quantum  numbers. 

The  necessarily  approximate  nature  of  these  latter  results  is  indi- 
cated by  the  form  of  the  relation  itself.  In  the  region  of  large  quantum 
numbers  where  the  quantized  states  lie  close  together  the  values  of  the 
amplitude  D  and  the  frequency  cd  which  then  equals  v  will  be  the  same 
in  the  initial  and  final  quantum  states  and  the  method  of  substituting 
into  (177)  is  obvious.  On  the  other  hand  in  the  region  of  small  quantum 
numbers,  D  and  co  will  have  different  values  in  the  initial  and  final  states 
and  it  is  not  evident  which  set  we  are  to  use,  or  whether  some  average 
of  these  quantities  taken  for  the  two  states  or  taken  over  the  intermediate 
classical  states  is  to  be  employed.  Apparently  one  of  the  greatest  advan- 
tages that  we  may  expect  from  the  newer  dynamics,  which  has  been 
mentioned  above,  will  be  the  removal  of  this  difficulty. 

115.  In  spite  of  these  uncertainties  as  to  the  correct  method  of 
applying  the  correspondence  principle,  it  has  proved  extremely  useful. 
Especially  in  the  hands  of  Kramers  ^^  it  has  given  excellent  predictions 
as  to  the  relative  intensities  of  related  spectral  lines,  while  with  its  help 
Tolman  and  Badger  ^^  have  been  able  to  account  for  the  absolute  intensities 
of  pure  rotation  lines  in  the  far  infra  red  spectrum  of  hydrogen  chloride. 

This  concludes  our  necessarily  somewhat  unsatisfactory  account  of 
quantum  theory  mechanics  and  we  are  now  ready  to  discuss  the  modifica- 
tions in  statistical  mechanics  which  must  be  introduced  in  order  that  we 
may  consider  quantum  phenomena. 

"Kramers,  Kgl.  Danske  Vidensk.  Selsk.  Shriftcr,  8,  III,  287  (1919). 
"Tolman  and   Badger,  Phys.   Rev.,  27,   383    (1926).     This   treatment   will   need 
modification  on  the  basis  of  the  new  quantum  dynamics. 


Chapter  8 

The  Incorporation  of  Quantum  Theory  into 
Statistical  Mechanics 

116.  Nature  of  the  Modification. — In  Section  104  of  the  preced- 
ing chapter,  we  introduced  the  new  idea  that  molecules  spend  at  least 
the  major  portion  of  their  time  in  quantized  states  of  internal  motion, 
and  have,  if  any,  only  a  fleeting  existence  in  a  condition  of  transition 
from  one  state  to  another.  It  can  at  once  be  seen  that  this  necessitates 
some  modification  in  the  older  statistical  mechanics  which  would  ascribe 
no  special  probability  to  states  of  motion  having  a  particular  energy 
content. 

It  is  evident  that  the  general  nature  of  the  modification,  which  must 
be  introduced  into  statistical  mechanics,  will  be  to  assign  in  some  way 
a  preponderating  /-weight  to  regions  in  the  /-space  which  correspond 
to  systems  in  which  the  molecules  have  the  definite  energy  contents 
allowed  by  the  quantum  restrictions.  This  can  be  accomplished,  more- 
over, by  going  at  once  to  the  phase  space,  belonging  to  each  kind  of 
molecule,  //-space,  and  assigning  a  larger  /<  weight  to  states  of  the  mole- 
cule, having  the  desired  energy  contents,  than  would  have  been  assigned 
on  the  basis  of  the  classical  theory, 

117.  There  are  two  ways  in  which  this  modification  might  be  intro- 
duced. In  the  first  place  leaving  the  dependence  of  energy  on  the  co- 
ordinates and  momenta  unchanged,  we  could  give  up  the  conclusion, 
reached  with  the  help  of   Liouville's  theorem,  that  different  regions  in 

the  phase  space  have  a  /z-weight  equal  to  the  volume   I  ....  I  dqi ....  dp„ 

involved,  and  assume  rather  that  the  /z-weight  has  been  gathered  in  from 
intervening  regions  of  the  phase  space  and  concentrated  on  those  sur- 
faces of  the  phase  space  which  correspond  to  quantized  motions.  As  a 
second  possibility  we  might  keep  the  distribution  of  //-weight  uniform 
throughout  the  phase  space,  and  change  the  dependence  of  energy  on 
the  coordinates  and  momenta  in  such  a  way  that  a  considerable  volume 
in  the  phase  space  would  all  belong  to  states  of  the  molecule  with  prac- 
tically the  same  energy  content.    The  two  possibilities  can  be  made  clearer 
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with  the  help  of  diagrams  in  which  we  plot  energy  E  against  phase  integral 
I  z=z  (j)  pdq. 

118.  Method  of  Concentrating  //-Weight. — The  first  method  in 
which  we  concentrate  ,a-weight  on  particular  boundaries  in  the  phase 
space,  leaving  the  dependence  of  energy  on  the  coordinates  and  momenta 
unchanged,  is  illustrated  by  Figure  1. 

The  system  considered  is  a  non-harmonic  oscillator  of  one  degree 
of  freedom,  and  the  dotted  curve  gives  the  relation  between  energy  E 
and  phase  integral  /,  which  would  accord  with  the  classical  theory. 


ab 


■)K 


5h 

Figure  1. 
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The  particular  oscillator  illustrated  by  the  figure  is  one  in  which  there 
is  a  deviation  from  Hooke's  law,  such  that  the  elastic  restoring  forces 
get  weaker  as  the  amplitude  increases.  This  evidently  results  in  decreas- 
ing values  of  the  frequency  of  oscillation  co  as  the  energy  increases. 
Hence,  since  energy  and  phase  integral  are  connected  with  frequency, 
on  the  basis  of  the  classical  theory,  by  the  equation 


dE^ 
dl 


(178) 


it  is  evident  that  the  energy  will  increase  less  and  less  rapidly  with  the 
phase  integral,  as  shown  by  the  curve. 

On  the  basis  of  the  classical  theory  the  oscillator  could  have  the  energy 
and  phase  integral  corresponding  to  any  point  on  the  dotted  curve.  In- 
deed the  a-priori  probability  that  the  oscillator  would  have  a  value  of 
its  phase  integral  between  /  and  I  -{-  dl  would  be  directly  proportional 
to  the  area  dl  =  dqdp.  Furthermore,  in  accordance  with  the  classical 
theory,  the  oscillator,  assuming  the  moving  part  to  be  charged,  would 
be  emitting  radiation  no  matter  where  the  representative  point  might 
be  located  on  the  curve,  the  frequencies  of  the  radiation  being  the  funda- 
mental frequency  o)  and  the  harmonics  xoi. 
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To  agree  with  the  quantum  theory,  we  might  modify  the  ahove  pic- 
ture as  follows.  In  the  first  place,  since  the  oscillators  are  in  general 
only  to  be  found  with  values  of  the  phase  integral  /  equal  to  some 
multiple  of  Ji,  we  could  gather  together  the  a-priori  probability  previously 
spread  uniformly  along  the  /-axis  and  concentrate  it  at  the  points  /  =  nJi. 
This  has  been  indicated  in  the  figure  by  inserting  heavy  dots  to  corre- 
spond to  the  allowed  values  of  phase  integral  and  energy.  In  the  second 
place,  in  spite  of  the  vibrations  which  the  oscillators  are  performing, 
we  should  have  to  assume  that  for  some  unknown  reason  they  can  remain 
for  an  appreciable  length  of  time  in  the  quantized  states  without  emitting 
radiation,  and  then  suddenly  emit  radiation  with  the  quantum  theory 
frequency 


Vln 


AE         .    AE 

——  ==  An-r-r- 

h  AI 


instead  of  the  classical   frequency 


y^  =z  TCO  ^  T 


dE 
dl 


(179) 


(180) 


119.  Method  o£  Changing  the  Relation  Connecting  Energy  with 
the  Coordinates  and  Momenta. — The  other  method  of  assigning  greater 
/<- weight  to  the  quantized  states  would  be  to  leaye  the  ^-weight  uniformly 
distributed  over  the  phase  space,  but  change  the  form  of  the  functional 
relation  connecting  the  energy  of  the  molecule  with  its  coordinates  and 
momenta.    The  method  is  illustrated  by  Figure  2. 


// 
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Figure  2. 

The  dotted  curve  again  gives  the  relation  between  energy  E  and  phase 
integral  /,  which  on  the  basis  of  the  classical  theory  would  belong  to  a  non- 
harmonic  oscillator.  The  heavy  line,  however,  giving  a  step-like  rise  of 
the  energy,  represents  the  changed  relation  between  energy  and  phase 
integral  which  is  proposed.  It  is  evident  that  the  molecule  in  question 
has    a-priori    probability   of    existence   only   at   the    energy   levels    cor  re- 
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spending  to  the  horizontal  portions  of  the  Hne  and  these  energy  levels 
are  chosen  so  as  to  agree  with  the  accepted  rules  of  quantization. 

It  may  furthermore  be  noted,  in  accordance  with  the  relation 
CO  =  dE/dl,  that  the  oscillator  would  have  zero  frequency  in  the  quantized 
states  and  hence  no  longer  vibrate,  in  agreement  with  the  fact  that  it 
can  remain  in  a  quantized  state  without  emission  of  radiation.  This 
would  also  have  the  advantage,  if  it  is  such,  of  giving  us  the  static  atom 
so  long  desired  by  chemists. 

To  round  out  the  picture  we  should  also  have  to  provide  some  way 
by  which  changes  from  one  quantum  state  to  another  could  occur  from 
time  to  time  in  accordance  with  the  laws  of  chance  through  the  absorp- 
tion or  emission  of  radiation.  This  could  perhaps  be  arranged  by  con- 
necting the  energy  levels  by  paths  which  would  give  the  molecule  the 
desired  behavior  when  it  is  interacting  with  radiation,  and  assuming  that 
the  molecule  sometimes  slips  into  one  of  these  connecting  paths.  Two 
of  these  paths  ab  and  be  have  been  indicated  in  the  figure  by  dot  and 
dash  lines,  and  remarkably  enough  all  the  necessary  paths  can  be  cor- 
rectly obtained  by  drawing  straight  lines  connecting  all  the  points  on  the 
classical  curve  which  represent  possible  values  of  E  and  /. 

The  path  ab  would  represent  the  motion  of  the  oscillator  when  it  is 
absorbing  or  emitting  radiation  in  between  the  first  two  energy  levels. 
It  should  be  specially  noticed  that  the  frequency  of  the  oscillation  on  this 
path  as  given  by  the  classical  laws  of  motion  would  be  the  actual  fre- 
quency of  the  radiation  known  to  be  absorbed  or  emitted  in  the  transition 
between  these  two  levels.    This  is  evident  since  we  have  from  the  figure 

_  dE  _  AE  _  ,.Q.. 

W  Classical  —  ~71^  —  ~7~  —  ''Quantum  l.-'^Oi; 

The  path  be  relates  to  transitions  between  the  levels  /  =  h  and  /  =  Ah, 
corresponding  to  a  change  in  quantum  number  of  3.  The  fundamental 
frequency  of  the  oscillation  on  this  path  would  also  be  correctly  given 
by  the  slope  of  the  line,  and  the  frequency  of  the  radiation  actually 
emitted  or  absorbed  is  known  to  be  the  third  harmonic  of  this  funda- 
mental frequency,  corresponding  to  the  change  of  3  in  the  quantum 
number. 

The  length  of  time  which  a  molecule  spends  in  the  intermediate  paths 
would  be  determined  by  the  damping  factor  for  the  oscillation,  and  the 
time  which  it  spends  in  the  quantized  states  would  depend  on  the  chance 
of  "slipping"  into  one  of  the  connecting  paths.  These  factors  would 
have  to  be  adjusted  so  as  to  agree  with  experiment. 

120.  Comparison  of  the  Two  Methods. — It  will  be  seen  that  the 
two  methods  of  assigning  increased  a-priori  probability  to  the  quantized 
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states  have  much  in  common.  The  first  method  is  rather  less  complicated 
and  provides  the  language  usually  adopted.^  The  second  method  tries  to 
retain  more  of  the  classical  mechanics  and  to  go  more  deeply  into  an 
explanation  of  the  frequencies  actually  absorbed  and  emitted.  The  postu- 
lated connecting  paths  have  some  of  the  properties  and  advantages  of  the 
virtual  oscillators  introduced  by  Bohr,  Kramers  and  Slater.^  The  method 
was  first  considered  by  the  author  a  number  of  years  ago  and  a  model 
constructed  illustrating  these  connecting  paths  between  different  energy 
levels  for  the  case  of  a  hydrogen  atom.^ 

Of  course  neither  method  is  entirely  satisfactory,  and  we  shall  have 
to  wait  for  a  relatively  complete  solution  of  the  problem  of  quantum 
mechanics  before  we  can  obtain  a  really  acceptable  picture  of  the  state  of 
affairs. 

121,  Result  for  Statistical  Mechanics. — For  our  present  purposes, 
it  is  only  of  interest  to  note  that  the  result  of  both  the  above  methods  or  of 
any  others  which  are  otherwise  acceptable,  will  be  the  same  as  far  as 
statistical  mechanics  is  concerned.  They  agree  in  dividing  up,  between 
the  different  quantized  states,  the  a-priori  probability  belonging  to  the 
whole  phase  space,  and  assigning  a  //-weight  of  magnitude  h  to  each 
quantized  degree  of  freedom.  This  result  can  be  incorporated  in  sta- 
tistical mechanics  by  the  introduction  of  the  following  postulate,  which 
is  expressed  in  a  general  enough  form  so  as  to  include  both  "classical" 
and  "quantized"  degrees  of  freedom.  (See  Section  106  in  the  preceding 
chapter.) 

Consider  a  molecule  hamng  n  degrees  of  freedom  with  respect  to 
which  the  motion  may  be  considered  as  classical,  and  s  degrees  of  free- 
dom for  which  the  motion  is  quantised.  The  jx-weight  for  any  given 
state  of  this  molecide  in  which  the  classical  coordinates  and  momenta  fall 
in  the  range  dcji  ....  dpn  and  the  quantised  coordinates  and  momenta  have 
the  values  corresponding  to  any  given  quantum  state  ivill  then  be 

lu  =  h'dq^  . .  .  .  dPn  (182) 

This  postulate  is  evidently  directly  required  by  the  second  method 
of  assigning  a-priori  probabilities  discussed  above,  and  is  in  entire  agree- 
ment with  the  first  method.  Certain  special  difficulties  may  arise  as  to 
the  assignment  of  /i-weight  to  the  lowest  possible  quantum  state,  but 
these  need  not  detain  us  now.  The  postulate  also  obeys  certain  restric- 
tions imposed  by  the  second  law  of  thermodynamics  which  have  been  dis- 

*See  for  example  Ehrenfest  and  Tolman,  Phvs.  Rev.,  24,  287   (1924) 

*Bohr,  Kramers  and  Slater,  Phil.  Mag..  47,  785  (1924). 

'See  Tolman,  Phys.  Rev.,  11,  261   (1918)  ;  /.  Opt.  Soc.  Anier.,  4,  211   (1922). 
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cussed  by  Ehrenfest  *  and  is  the  postulate  which  has  always  been  at  least 
implicitly  chosen.^ 

122.  Resulting  Modification  in  the  Maxwell-Boltzmann  Dis- 
tribution Law. — It  is  evident  at  once  that  this  new  postulate  will  lead 
to  modifications  in  the  Maxwell-Boltzmann  distribution  law  in  the  case 
of  degrees  of  freedom  for  which  the  motion  is  quantized.  The  nature 
of  these  modifications  is  evident,  provided  we  accept  the  postulate. 

For  a  molecule  of  n  degrees  of  freedom,  all  of  which  are  "classical," 
the  Maxwell-Boltzmann  distribution  law  can  be  written  in  accordance 
with  equation  (72)  in  Chapter  4  in  the  form 

£ 

_         Ne~  kfdqi dpn 


C...('e'~"Pf  dq^ dpn  (183) 

where  dN  is  the  number  of  molecules  having  coordinates  and  momenta 
falling  in  the  range  dqi  ....  dpn  and  the  integrations  in  the  denominator 
are  to  be  taken  over  the  whole  phase  space. 

For  a  molecule  of  s  degrees  of  freedom,  all  of  which  are  quantized, 
in  analogy  with  equation  (183)  we  can  then  write  in  accordance  with 
our  postulate,  for  the  number  of  molecules  Ni  falling  in  the  i'th  quantum 
state,  the  expression 

N,  =  ^lfZpL  =  ^  (184) 

}'  j 

where  the  summation  in  the  denominator  is  taken  over  all  possible  quantum 
states. 

Evidently  for  a  molecule  having  n  classical  degrees  of  freedom  and  j- 
quantized  degrees  of  freedom  we  could  use  an  expression  of  the  form 

dm  = Ne-TTe~Jfdq,....dpn  ^^g^^ 


2£?      kT  { i  e      kT  dqi dpn 


provided  the  energy  increase  could  be  divided  into  a  part  St  characteristic 
of  the  quantum  state  and  a  part  e  dependent  on  the  q's  and  p's. 

Cases    often   arise,    in   connection   with    degenerate    periodic   motions, 

*See  Ehrenfest  and  Trkal,  Proc.  Amst.  Acad.,  23,  179  (1920). 

"  See  Bohr,  "On  the  Quantum  Theory  of  Line  Spectra,"  Copenhagen  Academy, 
1918,  Part  I,  p.  25.  For  degenerate  states,  we  must  multiply  by  the  number  of 
individual  non-degenerate  states  considered  together  in  the  single  degenerate  state. 
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ill  which  several  quantized  states  have  the  same  energy  content.  When 
this  happens  it  is  often  convenient  to  write  equation  (184)  in  the  form 

^^^Np^T^  (186) 

^pje~'kf 
i 

where  the  quantities  pi,  etc.,  are  the  number  of  quantum  states  with 
the  same  energy  content  which  have  been  lumped  together.  The  quantities 
pi  are  of  course  integers  and  are  themselves  called  the  a-priori  prob- 
abilities for  the  collection  of  degenerate  states  which  are  considered  as 
combined  together  into  a  single  state. ° 

°  See  preceding  footnote. 


Chapter  9 
The  Specific  Heat  of  Diatomic  Gases 

123.  Rotational  and  Vibrational  Specific  Heat. — The  first  applica- 
tion which  we  shall  make  of  our  new  form  of  the  Maxwell-Boltzmann 
distribution  law,  modified  so  as  to  take  account  of  the  quantum  theory, 
will  be  to  investigate  the  specific  heat  of  diatomic  gases.  For  this  pur- 
pose it  will  be  a  close  approximation  if  we  consider  a  diatomic  molecule 
as  a  rigid  dumb-bell  like  structure  which  can  rotate  about  its  center  of 
gravity,  and  calculate  the  contribution  to  the  specific  heat  made  by  the 
possibility  thus  provided  of  acquiring  rotational  energy.  In  the  first 
analysis  it  will  not  be  necessary  to  consider  the  vibrational  energy  of  the 
atoms  along  the  line  joining  their  centers  of  gravity,  since  the  con- 
straints holding  the  atoms  together  prove  to  be  so  strong  that  appreciable 
vibrational  energy  usually  does  not  enter  until  relatively  high  tempera- 
tures. It  is  also  unnecessary  to  allow  for  changes  in  the  distance  apart 
of  the  two  atoms  due  to  centrifugal  force,  since  the  effects  turn  out  to 
be  inappreciable. 

124.  Quantization  of  a  Simple  Rotator. — The  different  quantized 
states  of  rotation  for  a  dumb-bell  like  molecule  are  the  same  as  for  any 
simple  rotator  and  can  easily  be  determined  from  the  Wilson-Sommerfeld 
rule  of  quantization  as  given  by  equation  (159)  in  Chapter  7. 


/ 


pdq  =  nh  (187) 

For  describing  the  motion  of  any  simple  rotator  we  need  only  a  single 
coordinate 

q  =  <f>  (188) 

which  gives  the  angle  through  which  the  rotator  has  turned  with  respect 
to  a  fixed  direction.  The  momentum  p^  corresponding  to  the  coordinate 
0  can  be  obtained  in  accordance  with  equation  (12)  in  Chapter  2,  by 
differentiating  the  Lagrangian  function  (difference  of  kinetic  and  potential 
energies)  by  0  giving  us 

where  /  is  the  moment  of  inertia  of  the  rotator. 
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Substituting  equations  (188)  and  (189)  in  (187),  we  obtain 

27r 

a>  pdq  =  <i  p^d(t>  =  I  J^dcj>  =  nih  ( 190) 

0 

where  in  accordance  with  convention  we  have  used  m  to  indicate 
rotational  quantum  numbers.  Since  the  angular  velocity  <^  is  constant  in 
a  steady  state  of  motion,  the  integration  is  easily  performed  and  we  obtain 
the  well-known  equation 

P^  =  U  =  j^  (191) 

for  the  dififerent  values  of  the  angular  momentum  of  a  rotator  which  are 
allowed  by  the  quantum  theory. 

125.  Application  to  a  Diatomic  Molecule. — For  a  rigid  diatomic 
molecule,  the  energy  of  rotation  will  all  be  kinetic  as  given  by  the  equation 

E  =  iJ<t>'  (192) 

which  on  substitution  of   (191)   gives 

£  =  |^  (193) 

as  the  possible  values  of  the  rotational  energy. 

In  applying  equation  (193),  it  will  be  noted  that  for  a  rigid  diatomic 
molecule  the  value  of  the  moment  of  inertia  /  will  be  a  constant  quantity 
independent  of  the  velocity  of  rotation  and  given  by  the  equation 

J  =  nnn^  +  m.r.J  (194) 

where  nh  and  uu  are  the  masses  of  the  nuclei  of  the  two  atoms  and 
Ti  and  To  their  distances  from  the  center  of  gravity.  Introducing  the  con- 
dition which  locates  the  center  of  gravity 

niiri  =  iitorz  (195) 

and  using  the  symbol 

_nHm^  (196) 

nil  -f-  nio 

for  the  mutual  mass  of  the  system,  and 

r  =  ri  +  ro  (197) 
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for  the  total  distance  between  the  nuclei,  equation  (194)  can  then  be 
rewritten  when  desired  in  the  form 

/  =  ■ . (ri  -f  ;-o )  -  =  i.ir-  (198) 

mi  +  iUo 

126.  Expression  for  Rotational  Specific  Heat. — Combining  equa- 
tion (193),  for  the  different  possible  values  of  the  rotational  energy 
of  our  molecule,  with  the  Maxwell-Boltzmann  law,  as  given  by  equation 
( 186)  in  the  last  chapter,  for  the  number  of  molecules  in  the  different 
quantum  states,  we  evidently  obtain  for  the  rotational  energy  of  the  N 
diatomic  molecules  in  one  mol  of  a  gas 

^,„       _^    '""''"'    m~lr 

£r,..= „,.,:;''  (199) 

"SipmC        8ir-J  kT 

where  the  summations  are  taken  for  all  values  of  in.  By  differentiating 
this  expression  with  respect  to  temperature,  we  obtain  what  may  be  called 
the  rotational  specific  heat 

^         dE  Rot 

C    Rot.    -  ^  . 

and  the  resulting  expression  can  be  written  in  the  very  simple  form 

%L  =  o^^!M0  (200) 

provided  we  make  the  substitutions 

(201) 


^Tt'JkT 

and  ^ 

Q  =  ^p,,c -"'"■-  (202) 

m 

In  order  to  determine  the  specific  heat  at  any  temperature  with  the 
help  of   equations    (200),    (201)    and    (202),   we   must   substitute  values 

^  The  general  quantity 

i 

which  often  occurs  in  statistical  mechanical  calculations  has  been  given  the  name 
"condition  sum"  (Zustandsumme).    In  the  classical  mechanics  it  would  have  the  form 

e 

I  e  dqi dpn 

and  is  thus  the  reciprocal  of  the  Maxwell-Boltzmann  C 
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for  the  quantum  numbers  in  and  the  a-priori  probabiHties  p,,,,  into  the  ex- 
pression for  the  "condition  sum"  Q.  The  correct  way  of  doing  this  is  a 
little  more  uncertain  than  might  at  first  thought  have  been  supposed,— in 
the  first  place  because  fractional  quantum  numbers  have  to  be  used  in 
connection  vi^ith  the  rotation  of  molecules,  and  in  the  second  place  because 
the  periodic  motion  involved  is  a  degenerate  one  and  it  is  not  certain  exactly 
how  many  completely  quantized  states  there  are  corresponding  to  each 
energy  level. 

127.  Assignment  of  Quantum  Numbers. — A  determination  of  the 
values  which  can  be  assumed  by  the  rotational  c^uantum  number  in  can 
be  obtained  by  comparing  ecjuation  (193)  for  the  dififerent  possible  values 
of  rotational  energy  with  the  results  of  spectrum  analysis.  Spectral  lines 
corresponding  solely  to  changes  in  the  rotational  energy  of  a  mdlecule 
occur  in  the  far  infra  red  and  have  now  been  studied  in  the  vei'y  simple 
case  of  rotating  hydrogen  chloride  molecules.-  The  analysis  of  the  series 
of  lines  definitely  shows  that,  although  the  successive  values  of  the  rota- 
tional energy  of  a  hydrogen  chloride  molecule  can  be  obtained  from  ecjua- 
tioh  (193)  as  would  be  expected  by  substituting  successive  values  of  in 
which  increase  by  steps  of  unity,  nevertheless  the  lowest  value  of  in  is 
not  itself  an  integer  but  has  the  value  one-half. 

The  full  meaning  of  the  occurrence  of  such  half  quantum  numbers 
is  not  yet  entirely  evident.  It  seems  possible,  however,  on  the  basis 
of  the  newer  dynamics  mentioned  in  Chapter  7,  that  this  is  fundamental. 
In  any  case  the  experimental  facts  indicate  that  in  the  evaluation  of  rota- 
tional specific  heats  we  must  substitute  for  the  successive  values  of  the 
rotational  quantum  number 

'''=2'  TV  2  ' ^^^^^ 

128.  Assignment  of  A-priori  Probabilities. — In  assigning  values 
to  the  a-priori  probabilities  in  the  expression  (202)  for  the  condition 
sum  Q,  we  must  take  account  of  the  fact  that  our  system  is  a  degenerate 
one,  since  there  is  only  a  single  fundamental  period  corresponding  to  the 
two  rotational  degrees  of  freedom.  As  a  result  of  this  degeneration  we 
have  obtained  only  a  single  independent  quantum  condition  as  expressed 
for  exa«nple  by  the  equation 

P,  =  '-!^  (204) 

for  the  possible  values  of  the  total  angular  momentum,  in  spite  of  the 
fact  that  the  system  has  two  degrees  of  freedom  which  are  performing 

'Czerny,  Zeitschr.  fiir  Phys.,  34,  227  (1925). 
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periodic  motions.  If  the  system  had  not  been  degenerate,  we  should 
have  obtained  two  independent  quantum  conditions.  On  account  of  these 
two  conditions,  we  should  have  then  had  to  introduce  another  quantum 
number  besides  m,  in  quantizing  the  motion,  and  should  have  found  that  a 
given  value  of  m  could  be  associated  with  more  than  one  value  of  the 
other  quantum  number.  Hence  from  this  point  of  view  we  may  regard 
equation  (204)  as  determining  a  group  of  quantum  states  rather  than  a 
single  quantum  state,  and  in  accordance  with  the  ideas  of  Bohr  as  given 
in  our  postulate  in  Chapter  8,  we  should  assign  to  the  whole  group  the 
weight  pmh^  where  pm  is  the  number  of  individual  states  belonging  to 
the  group. 


rn  =  //2 


rr\  -3/2 


■m^S/2 


Figure  3. 


To  determine  the  number  of  quantum  states  all  corresponding  to  the 
same  value  of  the  quantum  number  m,  we  may  think  of  our  rotating  mole- 
cule as  immersed  in  a  weak  magnetic  field  which  will  superpose  on  the 
motion  a  Larmor  procession  around  the  direction  of  the  field.  This  will 
then  introduce  a  new  period  in  the  motion,  the  period  of  the  precession, 
and  we  shall  then  be  able  to  obtain  two  quantum  restrictions  correspond- 
ing to  the  two  periods  present  in  the,  motion.  We  obtain  in  this  way 
the  number  of  separate  quantum  states  which  correspond,  in  this  weak 
magnetic  field,  to  a  given  value  of  m,  and  at  the  limit  of  zero  magnetic 
field  shall  still  assume  the  same  effective  number  of  quantum  states. 

As  a  result  of  such  investigations,  it  has  been  found  that  the  two 
quantum  conditions  prescribe  not  only  that  the  total  value  of  the  angular 
momentum  must  increase  by  multiples  of  h/2n,  but  that  the  component 
parallel  to  the  external  field  must  also  increase  by  multiples  of  h/27i.  This 
result  makes  it  at  once  possible  to  determine  the  number  of  quantum 
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states  corresponding  to  a  given  value  of  m  as  shown  by  the  foregoing 
figures  in  which  the  vectors  represent  the  possible  values  of  the  angular 
momentum.     (The  direction  of  the  external  field  is  taken  as  vertical.) 
On  the  basis  of   these  results   we  may  evidently  put  ^   for  the   a-priori 
probabilities  of  the  degenerate  states 

p^  =  2m-\-l=2,  4,  6,  8, (205) 


as 
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Figure  4. 

129.  Application  to  the  Specific  Heat  of  Hydrogen. — The  fore- 
going considerations  have  been  applied  to  an  interpretation  of  the  specific 

'  The  above  result  has  been  obtained  by  taking  the  numbers  1/2,  3/2,  5/2  etc., 
not  only  as  the  possible  values  of  the  rotational  (azimuthal)  quantum  number  m 
which  determines  the  total  angular  momentum,  but  also  as  the  values  for  the 
equatorial  quantum  number  r  which  determines  the  quantized  component  of  angular 
momentum  parallel  to  the  field.  The  former  choice  is  necessitated,  as  we  have 
remarked,  by  Czerny's  results  on  the  far  infra  red  spectra  of  hydrogen  chloride. 
The  second  choice  is  more  arbitrary.  If  we  should  use  the  values  0,  1,  2,  3,  etc., 
for  the  equatorial  quantum  number  r  we  should  obtain  the  a-priori  probabilities 
1,  3,  5,  etc.,  for  the  successive  values  of  m.  This  choice  is  favored  by  Fowler, 
Phil.  Mag.,  49,  1272  (1925);  Dieke,  Zcitschr.  fur  Phys.,  33,  161  (1925),  Nature. 
115,  875  (1925);  Kemble,  Zeitschr.  fiir  Phys.,  35,  286  (1925),  and  Bourgin  and 
Kemble,  Phys.  Rev.,  27,  802  (1926),  since  it  agrees  with  the  summation  rule  of 
Dorgelo  and  Burger.  On  the  other  hand  the  work  of  Pauling,  Phys.  Rev.,  27,  586 
(1926),  indicates  that  such  an  assignment  of  quantum  numbers  would  lead  to  an 
impossible  negative  dielectric  constant  for  hydrogen  chloride  gas. 

The  correct  decision  between  the  two  possibilities  is,  however,  quite  possibly 
pm  =  1,  3,  5,  etc.,  since  Pauling's  difficulty  disappears  in  the  new  quantum  dynamics. 
See  Mersing  and  Pauli,  Phys.  Zcits.,  27,  509  (1926),  Kronig,  Proc.  Nat.  Acad., 
12,  488  (1926),  and  Van  Vleck,  Nature,  118,  226  (1926).  Nevertheless  the  method 
of  substituting  the  a-priori  probabilities  into  equation  (202)  would  be  the  same  with 
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heat  of  hydrogen  by  substituting  equations  (203)  and  (205)  into  (202), 
and  performing  the  indicated  calculations.  Assuming  a  value  of  the 
moment  of  inertia  /  which  would  give  a  good  fit,  the  full  line  curve  shown 
in  Figure  4  was  obtained.'*  The  value  of  /  from  the  curve  was  found  to  be 
1.387  X  10'*^  c.g.s.  units.  The  value  is  of  the  right  order  of  magnitude 
from  other  information  as  to  the  hydrogen  molecule. 

The  failure  of  the  curves  at  high  temperatures  is  presumably  due  to 
the  over  simplification  of  our  hypothetical  model  of  the  molecule.  Cal- 
culations of  Kemble  and  Van  Vleck  ^  indicate  that  at  high  temperatures 
the  potential  energy  due  to  extension  of  the  molecule  by  centrifugal  force 
and  due  to  the  presence  of  vibrational  energy  must  both  be  taken  into 
account. 

130.  Application  to  the  Entropy  of  Diatomic  Gases. — Hydrogen 
is  the  only  diatomic  gas  of  low  enough  boiling  point  and  low  enough 
moment  of  inertia  so  that  we  can  make  any  direct  application  of  the 
quantum  theory  of  rotational  specific  heat.  The  theory,  however,  finds 
an  important  application  in  calculating  the  entropy  of  diatomic  gases  at 
temperatures  where  the  rotational  specific  heat  has  already  reached  the 
value  R. 

This  application  is  based  on  the  assumption  that  the  entropy  of  any  gas 
would  be  that  given  by  the  Sackur  formula  for  the  entropy  of  monatomic 
gases,  prozndcd  we  go  down  to  temperatures  low  enough  so  that  the 
specific  heat  has  fallen  to  the  value  for  monatomic  gases.''  If  this  assump- 
tion is  correct,  and  it  seems  to  be  a  necessary  consequence  of  considera- 
tions which  we  shall  later  introduce,  we  can  evidently  calculate  the  entropy 
of  any  perfect  gas  by  adding  to  the  Sackur  expression  an  additional  term 
obtained  by  a  suitable  integration  from  the  absolute  zero  to  the  temperature 
of  interest.    We  obtain  for  the  entropy  of  one  mol  of  any  perfect  gas 

S=[\R^o,T-R^o^P  +  R^o,{^2^^)+lR■\+](^C,-\R)§     (206) 

0 

where  the  first  term  is  the    Sackur  expression,"   in  being  the  mass  of  a 

either  choice,  and  experimental  data  are  not  yet  sufficient  to  permit  a  decision  on 
the  basis  of  the  difference  in  the  predicted  values  of  the  specific  heat. 

The  finally  acceptable  treatment  of  the  quantum  states  of  the  dipole  rotator 
and  their  a-priori  probabilities  will  perhaps  have  to  be  based  on  the  new  dynamics. 

*_Tolman,  Phys.  Rev.,  22,  470  (1923).  The  above  treatment  would  have  to  be 
modified  on  the  basis  of  the  new  quantum  mechanics,  but  the  correct  modification 
is  still  uncertain.     See  Van  Vleck,  Phys.  Rev.,  28,  480  (1926). 

"Kemble  and  Van  Vleck,  Phys.  Rev.,  21,  653  (1923). 

*See  Tolman,  J.  Amcr.  Cheni.  Soc.,  42,  1185   (1920). 

''  For  small  modifications  which  must  be  made  in  this  formula  where  the  molecule 
has  a  symmetry  number  a  ^  1,  and  the  lowest  state  of  rotation  has  an  a-priori 
probability  p  ^  I,  see  Chapters  12  and  13. 
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molecule,  and  the  second  term  is  the  excess  entropy  due  to  the  rise  of 
specific  heat  above  the  value  for  monatomic  gases. 

For  diatomic  gases  at  sufficiently  low  temperatures  the  excess  specific 
heat  {Cp  —  S/2R)  will  evidently  be  the  rotational  specific  heat,  so  that 
the  second  term  in  equation  (206)  can  be  written  in  the  following  form, 
by  introducing  equation  (200)  for  the  rotational  specific  heat, 

'S'Rot.  =  1  (  Cp  —  -xR  j-y-  =  I  CRot.  d  log  T 

0  0 

=  ]Ro^^^^dXogT  (207) 

0 

Introducing  the  relation  between  o  and  T  given  by  equation  (201), 
we  may  transform  the  above  expression  as  follows : 

T 

Sr=  —  I  Ro-  — -T-| —  rf  log  0 

.      ^  of  log  Q 

log  Q  —  o 


=  R     \osQ  —  o  rir;^^  (208) 

0 

Since  most  expressions  for  the  condition  sum  Q  give  simple  expressions 
for  log  Q  at  high  and  low  temperatures,  equation  (208)  is  often  an  easy 
one  to  apply.  This  method  of  treatment  is  due  to  Tolman  and  Badger.' 
The  most  accurate  test  of  equation  (208)  is  afiforded  by  the  case  of 
hydrogen  chloride  where  we  know  from  spectral  data  that  the  moment  of 
inertia  of  the  molecule  is  /  ^  2.594  X  10"*°  gm.  cm.^  and  also  know  that 
half  quantum  numbers  are  to  be  used.  Substituting  in  the  foregoing 
equations  we  obtain  for  the  theoretical  value  of  the  entropy  of  one  mol 
of  hydrogen  chloride  gas  at  25°  C.  and  1  atm.  pressure  the  value  43.2 
cal./deg.  as  compared  with  the  experimental  value  43.3. 

131.  The  above  methods  have  many  practical  applications  in  connec- 
tion with  a  number  of  other  gases.  Allowance  must  be  made  for  the 
entrance  of  vibrational  specific  heat  at  the  higher  temperatures,  and 
this  explains  the  reason  why  diatomic  molecules  have  specific  heats  above 
5/2  R  at  higher  temj^eratures.  It  may  also  be  noted  that  the  weaker  the 
binding  force  between  the  atoms  the  lower  the  temperature  will  be  at 
which  vibrational  energy  will  become  appreciable.  This  is  found  to  con- 
nect satisfactorily  the  chemical  facts  that  w^e  know  about  diatomic  mole- 
cules with  their  known  specific  heats. 

*  Tolman  and  Badger,  /.  Amer.  Chem.  Soc,  45,  2277  (1923). 


Chapter  lo 
The  Specific  Heat  of  Solids 

132.  Nature  of  the  Experimental  Facts.^ — We  shall  next  apply  the 
Maxwell-Boltzmann  distribution  law,  modified  so  as  to  take  account 
of  the  quantum  theory,  to  a  determination  of  the  specific  heat  of  solids. 
On  the  basis  of  the  classical  theory  we  were  led  to  the  conclusion  that  a 
solid  composed  of  A^  atoms  would  have  at  any  given  temperature  the 
heat  capacity  3Nk,  corresponding  to  an  equipartition  both  of  kinetic  and 
potential  energy  among  the  3N  degrees  of  freedom.  The  actual  experi- 
mental facts,  however,  show  that  although  the  heat  capacity  of  many 
solids  has  approximately  this  value  at  room  temperature,  it  falls  below 
SNk  at  lower  temperatures,  approaching  zero  indeed  at  the  absolute  zero 
of  temperature.  On  the  basis  of  our  new  information  as  to  the  quantiza- 
tion of  periodic  motions,  it  is  evident  that  the  principal  cause  of  this 
fall  below  the  classical  value  is  due  to  the  fact  that  the  atoms  in  a  solid 
must  themselves  be  performing  periodic  motions,  and  hence  can  only 
assume  certain  discrete  amounts  of  energy  as  allowed  by  the  quantum 
theory,  rather  than  the  continuous  succession  of  energies  allowed  by  the 
classical  theory.  We  shall  now  show  how  to  introduce  these  quantum 
restrictions,  following  substantially  the  method  of  Debye. 

133,  Modes  of  Vibration  of  an  Elastic  Solid. — Let  us  consider 
the  heat  capacity  of  an  isotropic  solid  composed  of  A^  equivalent  atoms. 
Since  the  thermal  energy  of  such  a  solid  will  be  due  to  the  periodic  oscilla- 
tion of  the  atoms  composing  it,  we  may  regard  the  solid  as  a  whole  as  a 
conditionally  periodic  system  of  3N  degrees  of   freedom. 

In  describing  the  periodic  motions  of  this  system  it  will  be  most  advan- 
tageous to  make  use  of  results  obtained  from  the  theory  of  elasticity  as 
to  the  periodic  motions  which  can  be  present  in  an  elastic  solid.  In 
accordance  with  elastic  theory,  the  internal  motions  of  an  elastic  solid 
can  be  analyzed  into  longitudinal  and  transverse  wave  motions  travelling 
with  the  velocities 

c,  =  Ji±lE      and       cr  =  j^  (209) 

1        Q  ^  Q 

where  X  and  /n  are  the  elastic  constants  of  the  solid  and  q  its  density.     In 
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the  case  of  a  body  of  finite  size  these  wave  motions  will  be  reflected 
at  the  boundaries  and  will  give  rise  to  sets  of  standing  waves  of  differ- 
ent frequencies.  Each  possible  way  in  which  such  a  standing  wave  can 
be  formed  is  called  a  mode  of  vibration  of  the  solid,  and  it  can  be  shown 
that  any  steady  state  of  internal  motion  for  the  solid  can  be  described  by 
specifying  the  amplitude  and  phase  for  each  mode  of  vibration. 

The  number  of  modes  of  vibration  lying  in  a  given  frequency  range 
will  evidently  depend  on  the  volume  of  the  solid  V ,  the  velocities  of  the 
longitudinal  and  transverse  wave  motions,  c^  and  Ct  and  the  frequency 
range  v  to  v  -\-  dv  under  consideration,  and  it  can  be  shown  that  the 
actual  number  of  modes  of  vibration  of  the  two  kinds  together  are  given 
by  the  equation 

,„       4nVv-dv    ,   SnVv-dv  /oir.\ 

^^  =  — 71 1 —3 —  (210) 

Cl  Ct 

the  first  term  giving  the  number  of  longitudinal  modes  of  vibration  and 
the  second,  the  number  of  transverse  modes,  which  is  twice  as  great 
owing  to  the  possibility  of  polarization  for  transverse  waves.  The  com- 
plete derivation  of  equation  (210)  is  somewhat  complicated,  but  except 
for  the  numerical  factors  47i  and  Sti  it  is  evident  that  the  expressions 
for  the  two  numbers  are  a  necessary  consequence  of  the  principle  of 
dimensional  homogeneity,  and  we  may  accept  the  complete  result  as 
"manna"  from  the  mathematician  without  more  ado.^ 

134.  Modes  of  Vibration  in  an  Actual  Solid  of  3N  Degrees  of 
Freedom. — The  theory  which  leads  to  equation  (210)  is  based  on  the 
assumption  of  a  continuous  solid,  corresponding  to  an  infinite  number  of 
degrees  of  freedom,  and  gives,  as  will  be  noted  from  the  equation,  an 
infinite  number  of  modes  of  vibration  provided  we  place  no  upper  limit 
on  the  frequencies  considered.  Our  actual  solid,  however,  has  only  3N 
degrees  of  freedom  and  hence  its  internal  motion  would  be  completely 
described  by  specifying  the  amplitudes  and  phases  for  3A^  modes  of 
vibration  instead  of  an  infinite  number.  This  leads  quite  naturally  to 
the  introduction  of  the  assumption  of  Debye  that  the  actual  internal  energy 
of  our  solid  can  be  satisfactorily  accounted  for  if  we  assume  it  to  be 
resident  in  3N  modes  of  vibration  whose  frequencies  are  distributed  in 
accordance  with  equation  (210),  running  from  zero  frequency  to  a 
maximum  frequency  v„i,  determined  by  the  requirement  that  the  total 
number  of  modes  of  vibration  is  to  be  3N. 

This  hypothesis  is  amply  justified  by  the  results  of  its  application. 
As  an  a-priori  justification,  moreover,  it  should  be  noted  that  the  dis- 

^  See  Jeans,  "Dynamical  Theory  of  Gases,"  Second  Edition,  "Chapter  XVII." 
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tribution  of  modes  of  vibration  with  frequencies  low  enough  so  that  the 
wave  lengths  are  long  compared  with  the  interatomic  distances  will 
obviously  be  practically  the  same  as  predicted  for  a  continuous  solid. 
And  indeed,  it  turns  out  that  even  the  maximum  frequency  usually  corre- 
sponds to  a  wave  length  which  is  fairly  long  compared  with  interatomic 
distances.  Hence  the  success  of  a  theory  built  on  these  lines  does  not 
seem  surprising. 

The  value  of  the  maximum  frequency  Vm  can  easily  be  determined 
with  the  help  of  equation  (210),  since  the  total  number  of  modes  of 
vibration  is  to  be  ZN.     We  obtain 


or  solving  for  the  maximum  frequency 


% 


(212) 


135.  Quantization  of  a  Harmonic  Oscillator. — In  the  foregoing 
treatment  we  have  given  the  number  of  modes  of  vibration  for  a 
solid  which  lie  in  any  given  frequency  range  v  \.o  v  -\-  dv.  Our  next 
interest  will  be  to  determine  the  average  energy  associated  with  a  mode 
of  vibration  of  a  given  frequency.  We  shall  then  be  in  a  position  to 
integrate  the  energy  for  all  the  frequencies  and  hence  determine  the  total 
energy  content  and  heat  capacity  of  our  solid. 

To  carry  out  this  program  we  must  first  obtain  the  quantum  restric- 
tions for  the  oscillating  motion  of  a  mode  of  vibration.  The  energy 
for  a  mode  of  vibration  or  for  any  harmonic  oscillator  of  one  degree 
of  freedom  is  given  in  terms  of  a  single  coordinate  q  and  momentum  p 
by  the  equation 

E  =  \aq^^\hp^  (213) 

where  a  and  h  are  constants,  and  the  quantum  condition  will  be  given  by 
the  Wilson-Sommerfeld  equation 


/ 


pdq  =  nh  (214) 


In  accordance  with  the  equations  of   motion   in  the   Hamiltonian    form, 
however,  we  may  write  for  dq 

dq  =  qdl  =  ^  df  =  bpdt  (215) 
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Substituting  in  (214),  we  obtain 

bp^dt  =  nh  (216) 


/ 


The  integral,  however,  is  to  be  taken  around  the  complete  period  of  the 
motion  t,  so  that  equation  (216)  can  be  rewritten  in  the  form 

The  period  t,  however,  is  the  reciprocal  of  the  frequency  v  and  the  average 
value  of  bp^  is  evidently  twice  the  average  potential  energy  of  the  oscillator, 
which  from  the  equality  of  the  average  kinetic  and  potential  energies  of 
the  oscillator,  is  equal  to  the  total  energy.     We  thus  obtain 

Ez=nhv  (217) 

for  the  possible  values  of  the  energy  of  any  quantized  harmonic  oscillator. 

136.  Average  Energy  of  a  Harmonic  Oscillator.— If  we  have  a 
system  containing  a  large  number  of  modes  of  vibration,  or  other  har- 
monic oscillators,  of  frequency  v,  we  may  determine  their  average  energy 
with  the  help  of  the  Maxwell-Boltzmann  distribution  law.^  For  the 
number  of  oscillators  in  a  given  quantum  state,  having  the  quantum 
number  n,  we  may  write 

N.  =  ^^^  (218) 


and  hence  for  the  average  energy 


Cflv.   


2^     kr 


—  (219) 


Substituting  the  value  nhv  for  the  energy  in  terms  of  the  quantum  num- 
ber n  as  given  by  equation  (217),  and  writing  out  the  summations  we 
obtain  ^ 

"It  should  be  noted  in  accordance  with  Chapter  3,  Section  30,  and  Chapter  4, 
that  the  Maxwell-Boltzmann  distribution  law  applies  just  as  reasonably  to  oscillators 
or  modes  of  vibration  as  to  those  components  of  a  system  which  are  more  commonly 
called  molecules. 

'This  substitution  assumes  that  the  quantum  number  n  has  the  successive 
values  0,  1,  2,  3  ....  The  newer  dynamics  mentioned  in  Chapter  7  would  give 
the  values  1/2,  3/2,  5/2  ...  . 

If  the  half  quantum  numbers  were  employed  the  average  energy  would  come  out 

hv         I    hv 


ekT 
thus  differing  from   (221)   by  a  constant  additive  term  which  need  not  concern  us. 
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_  hv\e      kT  -\-2e      kv  +  3g       fer  +  . .  J 

/  _iiL  —IhJL  —ihJL  \  K^^^) 

\\ -[- e      kT -\- e      kT  -\- e      kT  -\- . .  .) 


and  by  division  this  is  easily  shown  to  reduce  to 

hv 

Cav.  —      7^7 

e  kT  —  1 


(221) 


which  is  a  general  expression  for  the  average  energy  of  harmonic  oscil- 
lators of  frequency  v  at  the  temperature  T. 

137.  For  a  system  containing  N  harmonic  oscillators  the  total  energy 

would  evidently  be 

Nhv 
j,^_mn^  (222) 

e  kT  —  1 
and  the  heat  capacity  of  such  a  system  would  be 

yekT  —  1) 

If  the  atoms  of  a  solid  had  only  a  single  frequency,  instead  of  the  range 
of  frequencies  given  by  the  Debye  theory,  the  specific  heat  of  a  solid 
would  evidently  be  three  times  the  value  given  by  equation  (223).  This 
was  the  original  proposal  of  Einstein  which  led  to  the  more  complete 
treatment  of  Debye. 

138.  Energy  Content  and  Specific  Heat  of  a  Simple  Solid. — We 

may  now  combine  equation  (210)  for  the  number  of  modes  of  vibration 
in  a  given  frequency  range  with  equation  (221)  for  their  average  energy, 
and  by  integrating  over  all  possible  values  of  the  frequency  from  zero 
to  the  maximum  Vm,  obtain  the  following  expression  for  the  total  energy 
of  the  solid 

This   expression   can   be   simplified   if   we   introduce    equation    (212) 
for  the  maximum  frequency 

It        y 


\87iV  2c j}^ 


CtH 
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and  substitute  the  symbols 

~-^  =  e       and     Jf=x  (225) 

Assuming  ©  which  depends  on  the  elastic  properties  of  the  body  as  in- 
dependent of  the  temperature,  we  obtain  for  the  energy  content  and  heat 
capacity 


and 


£  =  9OTr(§)7;^rf.  (226) 

C,  =  9NkU{^y  Jl-^i'-'-Y-J^l  (227) 

139.  It  should  be  noted  in  accordance  with  these  equations  that  the 
energy  content  and  heat  capacity  of  the  simple  solids  under  considera- 
tion are  functions  of  a  single  parameter  O,  which  is  dependent  on  the 
elastic  properties  of  the  solid.  This  parameter  has  the  dimensions  of 
temperature  and  is  sometimes  called  the  characteristic  temperature  for  the 
substance.  For  purposes  of  representing  the  experimental  facts  the  value 
of  0  is  of  course  best  taken  from  actual  measurements  on  specific  heats. 
It  is,  however,  also  calculable,  from  the  elastic  properties  of  the  solid, 
and  in  accordance  with  equations  (212)  and  (225)  has  the  value 

in  terms  of  the  velocities  of  the  two  kinds  of  wave  motions.  These 
velocities,  however,  are  themselves  dependent  on  the  quantities  usually 
measured,  density  q,  compressibility  x,  and  Poisson's  ratio  o  in  such  a 
way  that  the  above  value  for  0  may  be  reexpressed  in  the  form 

^  ^  3.62  X  10-^ 


where  M  is  the  molecular  weight,  and  the  other  quantities  are  in  c.g.s.  units. 

140.  Approximations  at  Low  and  High  Temperatures. — The  gen- 
eral evaluation  of  the  definite  integral  occurring  in  equations  (226)  and 
(227)  is  not  possible  in  algebraic  form,  and  we  may  first  consider  the 
simple  approximations  which  hold  at  low  and  high  temperatures. 

When  T  is  small  the  upper  limit  of  the  definite  integral  may  evi- 
dently be  taken  as  infinity  instead  of  O/T.     With  this  limit  the  integral 
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is  known  to  have  the  value  n*/l5,  and  equations  (226),  and  (227)  assume 
the  form 

^  =      503  (230) 


and 


C.  =  ^^^^  (231) 


It  is  of  special  interest  to  note  that  at  low  temperatures  the  energy 
content  and  specific  heat  thus  become  proportional  respectively  to  the 
fourth  and  third  powers  of  the  temperature.  This  is  not  only  in  accordance 
with  the  actual  experimental  facts,  but  should  receive  special  attention, 
since  the  result  differs  from  that  obtained  in  earlier  less  complete  theories 
of  the  specific  heat  of  solids.  The  result,  however,  is  in  agreement  with 
considerations  based  on  the  principle  of  similitude.* 

Attention  is  also  called  to  the  fact  that  the  approximations  valid  at 
low  temperatures  are  in  reality  independent  of  the  assumption  that  the 
solid  has  a  discontinuous  atomic  structure.  This  will  be  seen  by  sub- 
stituting into  equations  (230)  and  (231)  the  value  given  for  &  by  equa- 
tion (228)  and  noting  that  the  number  of  atoms  N  disappears  from  the 
expressions.  The  possibility  of  treating  the  system  at  low  temperatures 
as  a  continuous  solid  is  due  to  the  fact  that  all  the  periodic  motions,  which 
are  of  low  enough  frequency  to  have  appreciable  energy  at  low  tempera- 
tures, have  a  wave  length  so  long  that  the  discontinuities  do  not  enter 
into  consideration. 

141.  At  high  temperatures,  noting  the  definition  of  x  given  by  equa- 
tion (225),  we  can  introduce  the  approximation 

e^=lJ^x  (232) 

Equations   (226)   and   (227)   then  reduce  to  the  classical  expressions  in 
accordance  with  the  law  of  Dulong  and  Petit 

E  =  3RT  (233) 

C^  =  SR  (234) 

a  result  which  is  of  course  to  be  expected. 

142.  Values  at  Intermediate  Temperatures. — At  intermediate  tem- 
peratures, values  for  the  integral  occurring  in  equations  (226)  and  (227) 
have  been  computed  by  methods  of  approximation.  We  give  below  a 
table  ^  in  which  the  heat  capacity  per  mol  in  accordance  with  the  Debye 

*See  Tolman.  Phys.  Rev.,  4,  145   (1914). 

"Taken  in  part  from  Lewis  and  Randall,  "Thermodynamics,"  McGraw-Hill,  1923. 


140-143] 


SPECIFIC  HEAT  OF  SOLIDS 


117 


theory  is  given  as  a  function  of  T/0.  The  table  also  includes  values  which 
will  be  found  useful  for  the  total  energy,  £„  and  entropy  Sv  which  the 
solid  acquires  on  heating  up  at  constant  volume  from  the  absolute  zero. 
These  quantities  are  defined  by  the  equations 


o*,<4 

Er        r^   dT 
&    ~  j^^   0 

0 

(235) 

ana 

s  -Ic  '^'^ 

O  V    1       ^  V        rp 

0 

(236) 

TABLE  I 

7/0 

logio  T/Q 

Cv 

Ev/& 

Sv 

.251 

-0.600 

0.11 

0.01 

0.04 

.316 

-0.500 

0.21 

0.02 

0.07 

.398 

-GAOO 

0.43 

0.04 

0.15 

.501 

-0.300 

0.84 

0.11 

0.30 

.631 

-0.200 

1.43 

0.26 

0.53 

.794 

-0.100 

2.19 

0.57 

0.98 

1.000 

0.000 

2.98 

1.09 

1.62 

1.259 

0.100 

3.78 

1.96 

2.41 

1.585 

0.200 

4.43 

ZM 

i.Z7 

1.995 

0.300 

4.91 

5.32 

4A2 

2.51 

0.400 

5.26 

7.83 

5.59 

3.16 

0.500 

5.49 

11.37 

6.82 

3.98 

0.600 

5.65 

15.94 

8.08 

5.01 

0.700 

5.72 

21.77 

9.41 

6.31 

0.800 

5.76 

29.28 

10.74 

7.94 

0.900 

5.78 

38.66 

12.07 

10.00 

1.000 

5.79 

50.57 

13.40 

12.59 

1.100 

5.80 

65.58 

14.74 

15.85 

1.200 

5.81 

84.50 

16.08 

143.  Comparison  with  Experimental  Data. — We  may  now  con- 
sider the  correspondence  between  the  Debye  theory  and  the  experimental 
facts.  In  actually  making  use  of  the  Debye  formulae  for  energy  content 
and  specific  heat,  it  is  customary  to  employ  values  of  the  characteristic 
temperature  0  for  dififerent  solids  which  have  been  obtained  from  the 
specific  heat  data  themselves,  rather  than  calculated  from  the  elastic 
properties  of  the  material.  This  of  course  leads  to  a  better  fit  between 
experimental  data  and  observations  than  would  otherwise  be  obtained 
and  increases  the  usefulness  of  the  Debye  formula  as  a  practical  method 
of  correlating  specific  heat  data.  The  improvement  in  fit  thus  obtained 
may  be  partially  due  to  uncertainty  as  the  proper  values  of  the  elastic 
constants.  In  addition,  however,  the  evil  eflfects  of  basing  the  deriva- 
tion on  an  assumed  homogeneity  and  isotropy  of  the  solids  may  be  partially 
eliminated  by  using  values  of  0  which  give  the  best  fit  to  the  curve. 
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Using  the  values  of  0  which  do  give  the  best  fit,  it  is  found  that 
there  are  a  considerable  number  of  substances  having  specific  heat  curves 
which  agree  almost  exactly  with  the  Debye  formula.  Thus  for  aluminum, 
copper,  lead,  diamond,  silver,  mercury,  thallium,  and  zinc  the  agreement 
is  practically  within  the  small  experimental  error  in  the  measurements  of 
specific  heat.* 

In  the  case  of  solids  which  do  obey  the  Debye  equation,  it  should 
especially  be  noted  that  a  single  measurement  of  specific  heat  at  a  tem- 
perature not  too  far  removed  from  that  where  Cv  has  fallen  to  half  the 
classical  value  will  be  sufficient  to  determine  the  whole  specific  heat  curve, 
as  well  as  the  entropy  and  heat  content  at  all  temperatures.  This  is  a 
fact  of  extreme  practical  importance  in  calculations  of  the  free  energy  of 
solid  substances. 

144.  Comparison  of  Thermal  and  Elastic  Values  of  0. — Not  only 
does  the  Debye  theory  give  a  very  precise  reproduction  of  the  specific 
heat  curve  for  a  considerable  number  of  solids  when  the  value  of  &  is 
chosen  so  as  to  give  the  best  fit,  but  the  values  of  0  thus  obtained  are  in 
very  reasonable  agreement  with  those  calculated  from  the  elastic  prop- 
erties, in  accordance  with  equation  (229).  This  is  shown  by  the  following 
table.'^ 

TABLE  II 

Solid  0  Thermal  0  Elastic 

Aluminum    396  402 

Copper    309  332 

Silver    215  214 

Lead    95  7Z 

In  passing,  attention  is  called  to  the  fact  that  the  light  and  resilient 
metals  have  large  values  of  6>  and  the  dense  and  yielding  ones  have  small 
values.  This  is  a  qualitative  expression  of  the  facts  expressed  by  equa- 
tion (229).  Since  large  values  of  &  mean  that  the  temperature  has  to 
become  high  before  the  specific  heat  is  freed  from  quantum  restrictions, 
this  agrees  with  the  high  frequencies  of  oscillation  that  we  should  expect 

*  As  a  matter  of  fact  the  values  of  Cv  in  Table  I  are  those  obtained  by  Lewis 
and  Gibson,  /.  Amer.  Cheni.  Soc,  39,  2554,  1917,  from  a  curve  representing  actual 
experimental  data.  For  other  tables  see  Nernst,  "Grundlagen  des  neuen  Warme- 
satzes,"  Second  Edition,  Kjiapp,  Halle,  1924. 

'  Taken  from  Born,  "Atomtheorie  des  f esten  Zustandes,"  Teubner,  Leipzig,  1923. 
The  elastic  constants  used  in  calculating  the  values  of  0  v^^ere  obtained  by 
GriJneisen  {Ann.  d.  Phys.,  22,  838,  1907;  25,  845,  1908),  and  are  the  values  at 
room  temperature.  At  lower  temperatures  quite  different  values  are  obtained.  It 
is  believed,  however,  that  these  changes  are  due  to  the  amorphous  substance  be- 
tween the  small  crystals  composing  the  metals  rather  than  to  a  variation  in  the 
behavior  in  the  crystals  themselves.  See  "Die  Theorie  der  Strahlung  und  der 
Quanten,"  1,  Solvay-Kongress,  1911    (Halle,  1914)  ;  Appendix  by  Eucken,  p.  387. 
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for  a  light  and  resilient  substance,  and  the  high  temperatures  that  must 
be  attained  before  light  and  resilient  crystals  like  the  diamond  obey  the 
law  of  Dulong  and  Petit. 

145.  Extensions  of  Debye  Theory. — The  chief  approximations  in 
the  Debye  theory  have  arisen  from  the  treatment  of  a  soHd  as  a  con- 
tinuous, homogeneous,  isotropic  structure  rather  than  as  the  actual  space 
lattice  which  we  believe  exists.  Treatments  based  more  closely  on  the 
theory  of  space  lattices  have  been  developed  by  Born,  Karman  and 
Thirring.^  Nevertheless,  such  treatments  are  complicated  and  although 
of  great  theoretical  importance,  the  result  for  practical  purposes  is  largely 
merely  to  alter  the  nature  of  the  approximations  which  must  be  intro- 
duced before  we  can  apply  the  conclusions  to  actual  experimental  results. 
There  are  two  cases,  however,  in  which  extensions  of  the  Debye  theory 
can  be  made  which  are  quite  simple  and  of  considerable  practical 
importance. 

146.  In  the  case  of  solid  compounds,  Nernst  has  suggested  the  pos- 
sibility of  dividing  the  internal  energy  into  two  portions  corresponding 
to  the  vibration  of  the  compound  molecule  as  a  whole,  and  the  mutual 
vibration  of  the  atoms  inside  the  molecule.  For  the  energy  belonging 
to  the  molecules  as  a  whole  we  can  use  the  Debye  expression  as  given 
by  equation  (224)  based  on  the  presence  of  all  possible  frequencies  from 
zero  to  the  maximum  Vm,  and  for  the  energy  of  the  atoms  inside  the 
molecule  can  use  the  Einstein  expression  as  given  by  equation  (222) 
substituting  the  actual  frequencies  of  oscillation  of  the  atoms  in  the  mole- 
cule.   We  can  thus  obtain  for  the  total  energy  an  expression  of  the  form 


E=9mr^^  +  ^^^^  (237) 

'^m     J   gkx 1  *    c  kT  1 

o 

where  the  summation  2  in  the  last  term  is  to  be  carried  out  for  all  the 
internal  vibrations  present  inside  the  molecule.  For  a  diatomic  molecule 
this  reduces  to  a  single  term. 

The  treatment  is  of  course  again  only  a  partial  approximation,  since 
the  absolute  division  of  the  internal  energy  into  the  two  kinds  is  an  over 
simplification,  and  the  X-ray  analysis  of  the  crystalline  substances  to  which 
the  formula  may  be  successfully  applied  usually  do  not  give  evidence  of 
separate  molecules  inside  the  crystal.  Nevertheless,  as  a  matter  of  fact, 
Nernst  has  shown  that  equation  (237)  gives  an  excellent  representation 
of  the  energy  content  for  a  number  of  substances.     In  applying  the  equa- 

*  See  Born,  previous  reference. 
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tion  to  a  number  of  crystalline  salts,  moreover,  he  has  obtained  good 
agreement  using  a  value  of  Vm  calculated  from  the  elastic  properties  of  the 
crystal,  and  a  value  of  vi  which  is  the  frequency  of  the  infra  red  absorp- 
tion band  (Reststrahlen)  as  measured  by  Rubens  for  the  salt.  This  latter 
identification  seems  reasonable,  since  we  should  expect  such  optical  effects 
to  result  from  the  relative  vibration  of  the  ions. 

The  following  table  taken  from  Nernst  ^  shows  the  close  agreement 
between  observed  and  calculated  values  of  the  heat  capacity  per  mol  of 
potassium  chloride. 

TABLE  III 
Heat  Capacity  per  Mol  for  KCI 

T 

degrees 
absolute 

22.8 

26.9 

30.1 

33.7 

48.3 

57.6 

70.0 

86.0 
235 
416 
550 

147.  Another  simple  and  important  extension  of  the  Debye  theory  is 
due  to  the  work  of  Lewis,  Eastman  and  Rodebush  ^^  on  the  specific  heat 
of  strongly  electropositive  metals,  in  which  even  at  low  temperatures  the 
specific  heat  rises  higher  than  would  accord  with  the  Debye  formula  and 
indeed  at  moderate  temperatures  may  exceed  the  classical  values  of  Dulong 
and  Petit.  The  explanation  of  this  departure  is  of  course  that  the  "free" 
electrons  which  account  for  the  high  conductivity  of  such  strongly  metallic 
substances  have  acquired  a  very  considerable  thermal  energy  even  at 
moderate  temperatures.  Further  quantitative  theoretical  study  of  the  form 
of  these  specific  heat  curves  would  be  desirable. 

•Wolfskehl  Kongress  Gottingen,  1914  (Leipzig-Berlin,  1914),  4,  Vortrag,  W. 
Nernst,  Kinetische  Theorie  Fester  Korper,  p.  81. 

"Lewis,  Eastman  and  Rodebush,  Proc.  Nat.  Acad.  Sci.,  4,  25   (1918). 
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11.78 
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Chapter  ii. 

The  Distribution  of  Radiation  at  Thermal 
Equilibrium. 

148.  Method  of  Treatment.— We  shall  next  apply  the  Maxwell- 
Boltzmann  law  modified  to  allow  for  quantum  theory  to  a  determina- 
tion of  the  distribution  with  respect  to  frequency  of  the  radiation  present 
in  an  enclosure  or  hohlraum  which  has  come  to  temperature  equilibrium 
with  the  walls.  This  problem  has  in  addition  to  its  great  theoretical  and 
practical  importance,  the  historical  interest  of  having  led  to  the  first 
introduction  of  the  quantum  theory  by  Planck,  in  his  derivation  of  the 
famous  Planck  radiation  law. 

For  the  purpose  of  this  chapter,  we  shall  regard  radiation  after  it 
has  once  been  emitted  as  a  wave  motion  propagated  in  accordance  with 
the  classical  laws  of  the  Maxwell  electromagnetic  theory  of  light,  and 
hence  shall  be  able  to  assign  the  energy  present  in  a  hohlraum  to  the 
different  modes  of  electromagnetic  vibration  which  are  present.  The 
quantum  theory  in  this  treatment  will  be  introduced  by  quantizing  the 
motion  of  these  modes  of  vibration,  as  in  the  derivation  of  equation  (217) 
in  the  last  chapter,  and  thus  permitting  them  to  assume  only  such  quanti- 
ties of  energy  as  are  multiples  of  hv.  It  will  be  found  that  this  treat- 
ment leads  to  the  Planck  radiation  law. 

In  a  later  chapter  we  shall  present  another  mode  of  treatment  due 
to  Einstein,  which  is  based  on  considerations  as  to  the  mechanism  of 
the  absorption  and  emission  of  radiation  by  the  material  of  the  walls 
which  are  in  thermal  equilibrium  with  the  radiation  in  the  hohlraum. 
In  this  later  treatment  it  will  not  be  necessary  to  assume  that  the  radia- 
tion consists  of  wave  motions,  and  indeed  the  treatment  on  the  whole 
adopts  at  least  a  friendly  attitude  towards  the  extreme  position  of  the 
existence  of  light  corpuscles.  The  quantum  theory  in  this  later  mode 
of  approach  will  be  introduced  by  assuming  that  the  emission  and  absorp- 
tion of  radiation  result  from  changes  in  the  quantum  states  of  atoms 
or  molecules  in  thermal  equilibrium  with  the  radiation.  The  final  result 
will  be  the  Planck  radiation  law  just  as  in  the  treatment  we  are  now 
going  to  undertake. 

121 
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149.  Modes  of  Vibration  in  the  Hohlraum. — For  the  number  of 
modes  of  electromagnetic  vibration  in  a  hohlraum  of  volume  V,  with 
frequencies  betw^een  v  and  v  -\-  dv  ysie  may  write  the  expression 

dZ  =  ^  v^dv  (238) 

where  c  is  the  velocity  of  light.  The  expression  is  analogous  to  that 
given  in  the  last  chapter  for  the  number  of  modes  of  transverse  vibration 
in  an  elastic  solid  and  the  derivation  is  similar.  Except  for  the  mag- 
nitude of  the  numerical  constant  Sn,  the  equation  is  evidently  completely 
derivable  from  the  principle  of  dimensional  homogeneity. 

150.  Average  Energy  of  a  Mode  of  Vibration. — Each  mode  of 
vibration  is  a  sub-system  of  one  degree  of  freedom,  its  state  being  com- 
pletely describable  by  the  specification  of  the  amplitude  and  phase  of 
the  vibration,  or  by  the  specification  of  the  value  of  a  single  coordinate 
and  the  corresponding  momentum.  Since  the  coordinate  performs  a  har- 
monic oscillation,  the  motion  can  be  quantized  in  the  same  way  as  for 
any  harmonic  oscillator  and  the  mode  of  vibration  can  only  assume  values 
of  the  energy  which  are  multiples  of  hv,  in  accordance  with  the  equation 

e  =  nhv  (239) 

derived  in  the  last  chapter.  Furthermore  the  average  energy  for  a  mode 
of  vibration  will  be 

£av.  =  -^^-  (240) 

e  kT  —  1 
as  given  by  equation  (221)  in  the  last  chapter.^ 

151.  Planck  Distribution  Law. — Combining  equations  (238)  and 
(240)  for  the  number  of  modes  of  vibration  in  a  given  frequency  range 
and  their  average  energy  and  dividing  by  the  volume  V ,  we  obtain 

,u=^J}^-^ir  (241) 

ekT —  1 

for  the  density  of  energy  in  the  frequency  range  dv  when  the  hohlraum 
has  come  to  thermal  equilibrium. 

This  is  the  Planck  radiation  law  which  agrees  with  the  experimental 
facts  within  the  limits  of  observational  error. 

In  the  region  of  low  frequencies  and  at  high  temperatures,  the  quantity 

^  For  the  result  obtained  using  half  quantum  numbers,  see  footnote  to  Section  136. 
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hv 

e''^      can  be  approximated  by  the  first  two  terms  of  its  expansion  and 
equation  (241)  rewritten  in  the  form 

du  =  ^^kTdv  (242) 

which  is  the  classical  Rayleigh-Jeans  formula  deducible  from  the  principle 
of  equipartition  of  energy. 

In  the  region  of  high  frequencies,  however,  equation  (241)  can  evi- 
dently be  rewritten  in  the  form 

d>7ihv^      —'2IL,  ,^,^, 

du  =  — ^— e     kT  dv  (243) 

which  is  the  so-called  Wien  equation.  ^ 

152.  Total  Energy  in  the  Hohlraum. — To  obtain  the  total  density 
of  energy  in  the  hohlraum,  we  may  integrate  equation  (241)  for  all  fre- 
quencies from  zero  to  infinity  and  obtain 

W  =     I       —^ 1:7, dv  =  —r  \„  ^        I       — :r  dx 

e  k'J  —  i 
»^^    74  (244) 
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which  is  the  familiar  Stefan-Boltzmann  law  for  the  total  density  of 
energy  in  the  hohlraum.  This  form  of  this  law  is  of  course  derivable 
from  thermodynamics  but  the  magnitude  of   Stefan's  constant 

"  =  i^  (245) 

was  not  theoretically  determinable  until  the  introduction  of  the  quantum 
theory.  The  value  given  by  equation  (245)  is  in  agreement  with  the 
experimental  results,  within  the  observational  error. 

Equations  (241)  and  (244)  have  a  wide  range  of  theoretical  applica- 
tion and  in  addition  well-known  practical  applications  in  the  measure- 
ment of  high  temperatures. 


Chapter  12 

General  Determination  of  the  Most  Probable  State 

of  a  System 

153.  Introduction. — In  the  foregoing  applications  of  statistical 
mechanics,  we  first  derived  the  Maxwell-Boltzmann  law  for  the  distribu- 
tion of  atoms  and  molecules  in  different  states  and  then  applied  this  law 
to  the  problem  of  interest.  We  are  next  going  to  consider  two  problems — 
the  vapor  pressure  of  solids  and  liquids  at  low  temperatures,  and  chemical 
equilibria  in  gases  at  high  temperatures — in  which  the  energy  connected 
with  an  atom  or  molecule  is  very  dependent  on  the  presence  of  neighbor- 
ing atoms  or  molecules  with  which  it  may  be  chemically  combined  or 
associated  in  the  solid  or  liquid  form.  This  dependence  of  the  energy 
in  such  a  far  reaching  way  on  the  coordinates  of  more  than  one  atom  or 
molecule,  makes  it  difficult  to  solve  the  problems  by  applying  the  Maxwell- 
Boltzmann  distribution  law,  since,  as  we  saw  in  the  original  derivation, 
the  quantity  e  occurring  in  the  Maxwell-Boltzmann  expression  is  the  rate 
of  change  in  the  energ}^  of  the  system  per  molecule  introduced  into  a 
given  region  of  the  phase  space  when  the  system  is  in  its  state  of  maximum 
probability,  and  in  the  case  of  the  systems  now  to  be  considered  we  could 
no  longer  make  the  simplification  of  taking  £  as  the  individual  energy 
of  a  single  molecule. 

This  makes  it  desirable  for  our  present  purposes  to  return  to  the 
use  of  the  same  kind  of  considerations  as  to  the  probabilities  of  differ- 
ent states  which  were  employed  in  deriving  the  Maxwell-Boltzmann 
distribution  law  itself.  For  this  purpose,  however,  we  must  first 
reexamine  some  of  our  fundamental  methods  since  we  must  now  take 
account  of  the  quantum  theory  and  allow  for  the  possibility  of  quantized 
states  in  a  way  not  contemplated  in  the  original  derivation  of  the  Maxwell- 
Boltzmann  law. 

154.  The  Microcanonical  Ensemble. — In  Chapter  3  we  made  a 
study  of  the  behavior  of  ensembles  of  systems  of  identical  structure, 
distributed  uniformly  in  the  QP  phase  space  or  y-space  with  total  energies 
between  E  and  E  -\-  dE.  Such  collections  were  called  microcanonical 
ensembles,  and  prove  to  be  very  useful  in  predicting  the  behavior  of 
single  systems  of  the  same  structure  as  those  in  the  ensemble.     In  the 
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first  place,  since  the  infinitesimal  dE  may  be  chosen  as  small  as  desired, 
all  the  systems  of  the  ensemble  may  be  regarded  as  having  substantially 
the  same  energy  E.  In  the  second  place  it  was  shown  with  the  help  of 
Liouville's  theorem  that  the  systems  composing  a  microcanonical  ensemble 
will  move  in  such  a  way  as  to  maintain  the  uniform  distribution  of  their 
representative  points  in  the  7-space.  Hence  if  we  accept  some  form  of 
the  ergodic  hypothesis  the  following  propositions  will  hold : 

1.  Setting  up  a  microcanonical  ensemble  of  systems  with  energies 
between  E  and  E  -f-  dE  and  then  selecting  a  system  from  the  ensemble 
at  random  is  equivalent  to  starting  off  a  single  system  with  the  energy  E 
and  then  taking  it  in  the  phase  it  happens  to  be  in  at  some  random  time 
after  starting. 

2.  The  probability  that  a  system  will  be  found  at  any  instant  in  a 
given  state  will  be  proportional  to  the  /-weight  of  the  state,  that  is  to 
the  volume  in  the  phase  space  belonging  to  those  systems  in  the  corre- 
sponding microcanonical  ensemble  which  are  in  the  state  in  question. 

These  propositions  make  it  possible,  as  we  have  already  seen,  to  use 
microcanonical  ensembles  in  predicting  the  most  probable  states  of  systems. 

155.  Fully  Excited,  Partially  Excited,  and  Frozen-In  Degrees  of 
Freedom. — The  above  considerations  as  to  the  relation  between  prob- 
ability and  volume  in  the  phase  space  apply  in  the  first  instance  to  sys- 
tems that  obey  the  classical  laws  of  dynamics  which  permit  the  representa- 
tive point  for  a  system  to  assume  a  continuous  range  of  positions  in  the 
y-space.  Under  these  circumstances  the  volume  in  the  phase-space  or 
7-weight  belonging  to  a  given  state  of  the  system  is  evidently  the  total 
volume  which  can  actually  be  occupied  by  the  representative  point. 

In  determining  the  y- weight,  however,  for  systems  where  some  of  the 
degrees  of  freedom  may  be  affected  by  quantum  restrictions,  we  find  it 
desirable  to  distinguish  three  dift'erent  types  of  degrees  of  freedom : — 
1.  Fully  excited  degrees  of  freedom  such  that  the  corresponding  co- 
ordinates and  momenta  can  be  regarded  as  governed  by  the  classical  laws 
of  motion.  2.  Partially  excited  degrees  of  freedom  such  that  the  corre- 
sponding coordinates  and  momenta  can  only  assume  values  belonging  to 
particular  quantized  states.  3.  Frozen-in  degrees  of  freedom  such  that 
the  corresponding  coordinates  and  momenta  are  in  the  lowest  possible 
quantum  state. 

In  the  calculation  of  the  /-weight  for  a  complex  system  of  the  above 
nature,  we  assign  to  the  fully  excited  degrees  of  freedom  the  continuous 
hyper- volume  which  can  be  occupied  by  the  phase  point  of  the  system 
when  the  system  is  in  the  state  of  interest.  In  the  case  of  the  other 
degrees  of  freedom,  however,  which  are  affected  by  quantum  restrictions, 
we  must  assign,  in  accordance  with  the  considerations  of  Chapter  8,  to 


126  STATISTICAL  MECHANICS  [Ch.  12 

each  quantum  hyper-surface  the  volume  in  the  /-spaces  which  Hes  between 
it  and  the  next  higher  quantum  hyper-surface,  thus  introducing  a  factor  h 
for  each  non-degenerate  degree  of  freedom  involved. 

156.  The  Possibility  of  Gas  Degeneration. — In  applying  the 
methods  of  this  chapter  to  gases,  we  shall  assume  that  the  three  transla- 
tional  degrees  of  freedom  for  each  molecule  of  the  gas  can  be  treated 
as  fully  excited  and  not  subject  to  quantum  restrictions.  This  agrees 
with  the  considerations  of  Chapter  7,  Section  106,  where  it  was  shown 
that  quantum  restrictions  became  negligible  when  the  periods  of  motion 
are  long  or  many  quanta  of  energy  are  already  present. 

The  possibility  certainly  arises,  however,  that  at  high  concentrations 
when  the  space  available  for  the  motion  of  the  molecules  becomes  small, 
and  at  low  temperatures  where  the  average  energy  of  the  molecules  is 
small,  that  quantum  restrictions  will  also  affect  the  translational  motion 
of  the  molecules.  If  this  should  occur  we  should  have  a  phenomenon 
which  has  been  given,  in  advance  of  its  elucidation  and  discovery,  the 
name  of  gas  degeneration. 

The  problem  of  possible  gas  degeneration  has  already  been  the  subject 
of  much  speculation,  but  unequivocally  acceptable  results  have  not  yet 
been  obtained  owing  to  doubts  as  to  the  proper  method  of  quantizing  the 
translational  motions  of  molecules.  In  any  case  it  can  be  remarked  that 
all  the  tentative  results  which  have  so  far  been  obtained  agree  in  making 
appreciable  degeneration  occur  only  as  the  result  of  extreme  conditions 
which  are  not  met  in  ordinary  experimental  work. 

Hence  we  shall  have  no  hesitation  in  treating  gases  in  the  proposed 
manner  and  accepting  the  results  as  valid  for  ordinary  purposes.  Thus 
for  example,  the  Sackur-Tetrode  formula  for  the  entropy  of  gases,  which 
will  be  derived  in  the  next  chapter,  can  be  regarded  as  valid  for  all  the 
purposes  to  which  it  is  usually  put. 

157.  Method  of  Determining  the  /-Weight. — We  now  have  the 
principles  which  are  to  be  used  in  computing  the  y-weight  of  a  state  of 
our  system.  In  the  actual  determination  of  /-weight  we  find  it  con- 
venient to  proceed  as  follows.^ 

Consider  a  system  composed,  for  example,  of  X  -\-  Y  -\-  Z  . . .  .  atoms 
of  different  kinds  and  assume  that  three  degrees  of  freedom  are  sufhcient 
for  each  atom.^  Then  choose  some  particular  exemplar  of  the  state  of 
interest,  with  the  representative  point  correspondingly  located  in  the 
y-space  of  6  (X-fF-fZ-f....)  dimensions. 

*The  method  is  that  of  Ehrenfest  and  Trkal,  Proc.  Amsterdam  Acad.,  23,  162 
(1920). 

"Ultimately  perhaps  3  degrees  of  freedom  for  each  proton  and  electron. 


155-157]  PROBABILITY  OF  STATES  127 

a.  Generation  of  Region  (71).— First  now,  without  making  any 
changes  in  the  combination  of  the  atoms  into  molecules  or  other  aggre- 
gates, move  the  representative  point  through  all  possible  positions  of  the 
y-space,  lying  in  the  energy  range  E  to  E  -{-  dE,  which  correspond  to 
the  state  in  question.  This  motion  of  the  point  will  give  the  molecules 
all  positions,  orientations,  and  velocities  consistent  with  the  state  of  in- 
terest, leaving  the  combination  of  the  atoms  unchanged,  and  will  describe 
a  region  in  the  y-space,  of  a  given  shape  and  volume,  say  71.  The  region 
will  of  course  be  partly  generated  by  continuous  motions  for  fully  ex- 
cited degrees  of  freedom  and  partly  arise  from  the  assignments  of  volume 
due  to  quantized  degrees  of  freedom,  but  this  need  cause  no  confusion.^ 

b.  Permutation  of  Atoms. — Second,  having  thus  generated  one  region 
in  the  y-space,  of  weight  yu  which  corresponds  to  the  state  of  interest, 
we  may  evidently  now  obtain  further  such  regions  by  permutations  of 
similar  atoms  with  each  other.  This  will  lead  to  further  regions  of  the 
same  volume  and  shape  as  before,  but  differently  located  owing  to  changes 
in  axes  accompanying  the  permutation  of  atoms.  The  total  /-weight 
of  the  state  will  then  evidently  be  the  product  of  the  weight  /i  for  one 
of  these  regions  by  the  total  number  of  such  regions  obtained  by  the 
permutations. 

To  calculate  the  number  of  these  regions,  we  note  that  the  total 
number  of  possible  permutations  of  similar  atoms  will  evidently  be 
{\X  \Y  \Z  . . .  .)  where  X,  Y,  Z  .  .  .  .  are  the  numbers  of  atoms  of  each 
of  the  various  kinds.  The  result  of  some  of  these  possible  permutations, 
however,  would  evidently  be  the  same  as  already  reached  by  the  motion 
of  the  representative  point  in  generating  the  region  of  weight  71.  Thus  if 
all  the  atoms  of  two  similar  molecules  are  interchanged  with  each  other, 
we  reach  a  new  phase  which  can  also  be  attained  by  a  motion  of  the 
representative  point  without  leaving  the  original  region  of  weight  71. 
Furthermore,  molecules  may  contain  similar  atoms  symmetrically  placed  so 
that  a  rotation  of  the  molecule  will  lead  to  the  same  result  as  a  permuta- 
tion of  atoms.  Such  permutations  may  be  called  internal  and  evidently 
must  not  be  counted  in  getting  the  total  7-weight. 

c.  Expression  for  y-weight. — We  may  now  evidently  write  as  an  ex- 
pression for  the  total  7-weight 

-  1^  iZlj; ,246) 

^  ~  |A[x    |A^   iTVa 0,^,  0,^,  03^3 . . . .  ^'  ^ 

where  71  is  the  weight  obtained  by  motions  of  the  representative  point 
which  do  not  involve  permutation  of  the  atoms,  and  the  rest  of  the  ex- 

^  In  general  we  shall  assign  the  weight  h  to  each  quantized  non-degenerate  degree 
of  freedom,  whether  partially  excited  or  frozen-in.  See,  however,  the  remarks  in  the 
last  two  paragraphs  at  the  end  of  the  next  chapter. 
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pression  is  the  number  of  new  similar  regions  which  can  be  obtained  by 
the  permutation  of  atoms,  X,  Y,  Z  .  .  .  .  being  the  number  of  atoms  of  each 
of  the  different  kinds,  A''i,  N^,  N3 .  .  .  .  the  number  of  molecules  of  each 

of  the  different  kinds,  and  Oi,  02,  03 the  symmetry  factors  for  the 

different  kinds  of  molecules  which  give  the  number  of  equivalent  rotational 
orientations  of  the  molecule  in  question. 

158.  Volume  of  Hypersphere  and  Hyperellipsoid. — In  calculating 
the  weight  of  a  region  generated  by  the  motion  of  a  representative  point 
for  our  system,  it  often  becomes  necessary  to  know  the  value  of  certain 
definite  integrals  which  may  be  regarded  as  the  volumes  of  a  hypersphere 
or  hyperellipsoid  in  a  space  of  n  dimensions.* 

Consider  the  hyper-volume  in  a  space  of  n  dimensions  included  inside 
a  surface  which  is  defined  by  the  equation 

X,'  +  X,'  +  ....  +  Xn.,^  +  Xn^  =  T^  (247) 

This  surface  may  be  regarded  as  a  hypersphere  with  the  volume  defined 
by  the  equation 

y=  f  f f  j'dxr(hr2  ....  dx^.^dxn  (248) 

where  the  integration  is  to  be  taken  for  all  values  of  Xi  . . .  .  Xn  such  that 
x,'+x,'-\- vn-,'  +  -t-n'  <  r'  (249) 

Introducing  limits  that  correspond  to  integration  over  positive  values 
of  the  variable  and  allowing  for  this  by  multiplying  by  the  proper  powei 
of  two,  we  may  rewrite  equation   (248)  in  the  form 

F  =  2"  f'  f'^''^— "^ fVr'-.,-....-.^  rv/,"-..=  -....-.v  ^^^^^^  _  _   _  j^.^^j^.^  (250) 

J  0  J  0  Jo  Jo 

The  evaluation  of  this  integral  can  be  obtained  in  the  following  simple 
manner.  Let  us  denote  the  volume  of  a  sphere  of  n  dimensions  and 
radius  r  by  the  symbol  F„(r).  Then  it  is  evident  from  equation  (250) 
that  we  may  connect  the  volume  of  a  sphere  of  n  dimensions  with  that 
of  one  of  W' —  1  dimensions  and  radius  \/r~  —  x'^  by  the  relation 

r 

„(r)  =2J'Vn.^(Vr'  —  x^)dx  (251) 


0 

or  substituting  for  simplification 

.r  ^  r  sin 

*See    Schoute,    "Mehrdimensionale    Geomctrie,"    Part    II,    pag-e    288,    Sammlung 
Schubert,  Leipzig,  1905. 


(252) 
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we  may  write 

IT 

Vn{r)  =2ry  Fn-i(r  cos</.)    cos<^  d4>  (253) 

Making  use  of  our  knowledge  of  the  volume  of  spheres  of  low  dimen- 
sions, we  see,  moreover,  that  the  solution  of  equation  (253)  must  be  of 
the  form 

Vn{r)=anr^  (254) 

where  a^  is  a  function  merely  of  n,  and  by  comparison  with  (253)  can 
write 

IT 

an  =  2an.ij\cos  cf>)^dcf>  (255) 

The  integral,  however,  has  known  values,^  which  permit  us  to  write  equa- 
tion (255)  in  the  forms 


o2.4.6 (n-l)  .     .       ... 

an=-2-r—^—p a».i  (w  is  odd) 

1.3.5 n 

^1.3.5 {n^l)7i  .     .            - 

an  =  Z  ^    .     2^«-i  (w  IS  even) 


(256) 


or  in  general 

2n 
On^  CPn-2  (257) 


n 


Since  Oi  and  a2,  however,  have  the  evident  values  of  2  and  n  respectively, 
we  may  at  once  write  for  the  volume  of  our  hypersphere  the  expressions 


n  -\-  \  n  —  1 

Vnir)  =  r"  (w  is  odd) 

(258) 

n 

TT         /         X  (27l)2  .  .  . 

Vnir)  =  o    .\.    r"  (wis  even) 

2.4.6 n 

or  in  terms  of  the  Gamma  function  the  volume  of  an  w-dimensional  hyper- 
sphere of  radius  r  can  be  expressed  in  the  general  form 


r(|  +  i)'      r(|  +  i) 


F„(r)  =     ^,   ^^'  \  r«  =      ,    """     .    r"  (259) 


See  B.  O.  Pierce,  "Short  Table  of  Integrals,"  No.  483. 


130  STATISTICAL  MECHANICS  [Ch.  12 

159.    Just  as  we  have  regarded  the  surface  bounded  by  the  equation 

as  a  hypersphere  we  may  regard  the  surface  bounded  by  the  equation 

^  +  %  +  ....+^  +  %  =  ^^  (260) 

as  a  hypereUipsoid  with  semi-axes  aib,  a-2fb,  etc. 

The  vohime  of  the  hypereUipsoid  can  easily  be  found  by  making  the 
substitutions 

r,  =  ^    ,    ^.,r=:£^ (261) 

a,         "        (to 


to  have  the  value: 


V  =  — 7  ^ V-  b"aia2 an  (262) 


r 


a-') 


These  formulae  will  be  needed  for  our  treatment  of  the  vapor  pres- 
sure of  crystals  and  glasses,  and  of  chemical  equilibria  in  gases. 


Chapter  13 

The  Vapor  Pressure  of  Crystals  and  Glasses  at  Very 
Low  Temperatures 

160.  Introduction. — We  shall  first  apply  the  considerations  of  the 
preceding  chapter  on  the  general  determination  of  the  most  probable 
state  of  a  system,  to  investigate  the  most  probable  distribution  of  mole- 
cules between  a  condensed  phase  and  a  vapor  phase  at  very  low  tempera- 
tures. We  shall  take  the  temperature  so  low  that  all  the  degrees  of 
freedom  of  the  atoms  are  frozen  in  except  the  three  degrees  of  trans- 
lational  freedom  for  the  vapor  molecules.  This  means  that  the  condensed 
phase,  crystal  or  glass,  regarded  as  a  quantum  system  will  be  in  the 
lowest  possible  quantum  state,  and  the  molecules  of  the  vapor  will  behave 
like  those  of  a  monatomic  gas. 

The  investigation  is  of  interest  because  of  the  bearing  of  the  results 
on  the  third  law  of  thermodynamics,  which  has  been  much  employed  in 
the  calculation  of  chemical  equilibria  and  the  calculation  of  the  free 
energy  values  of  chemical  substances.  We  shall  find  that  a  given  sub- 
stance must  be  regarded  as  having  the  same  entropy  when  present  at 
the  absolute  zero  in  different  crystalline  forms.  On  the  other  hand  a 
substance  at  the  absolute  zero  in  the  form  of  a  supercooled  liquid  or  glass 
may  have  a  greater  entropy  than  in  the  crystalline  form  in  agreement 
with  the  ideas  of  Lewis  and  Gibson.  We  shall  also  find  that  the  entropy 
of  a  monatomic  gas  or  of  a  polyatomic  gas  at  temperatures  low  enough 
so  that  it  behaves  as  a  monatomic  gas,  will  be  given  by  the  Sackur 
formula  with  the  Tetrode  value  for  the  constant. 

161.  The  y-weight  of  Vapor  and  Crystal  at  Very  Low  Tempera- 
ture.— Consider  a  system  composed  of  X  -\-  Y  -\-  Z  .  . .  .atoms  of  differ- 
ent sorts  contained  in  a  volume  V,  and  combined  into  similar  molecules 
of  mass  M,  and  a  symmetry  number  o.  Let  A''  of  these  molecules  be  in 
the  form  of  a  vapor,  and  the  remainder  A^',  in  the  form  of  a  perfect 
crystal.  And  let  the  temperature  be  so  low  that  all  the  degrees  of  free- 
dom except  the  3^^  translational  degrees  of  freedom  for  the  vapor  mole- 
cules are  frozen  in,  these  latter,  however,  being  regarded  as  fully  excited. 

In  accordance  with  the  considerations  of  the  preceding  chapter  it  is 

131 
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evident  that  we  may  write  for  the  7-weight  of  this  state  of  the  system 
the  expression 

\X     \Y    \Z  3{X+V  +  Z  +  ....  — N) 

where  the  last  factor  y^  is  the  weight  of  the  6A'^  dimensional  surface  in  the 
y-space  that  would  be  generated  by  moving  our  representative  point  so  that 
the  N  vapor  molecules  assume  all  positions  and  velocities  consonant  with  the 
volume  of  the  vessel  and  energy  available,  the  factor  h^^^  +  y  +  z  +  . .  .~n) 
arises  from  the  weight  that  must  be  assigned  to  the  frozen  in  degrees 
of  freedom,  and  the  factor  {\X  \Y  \Z  . . .  .)/\N  0^  is  the  number  of 
permutations  of  atoms  that  will  lead  to  the  generation  of  similar  surfaces 
of  weight  Yjf . 

162.  The  value  of  the  weight  y^  which  arises  from  the  possibility  of 
assigning  different  positions  and  velocities  to  the  vapor  molecules  will 
evidently  be  given  by  the  multiple  integral 

yN=  j J  dx^dyidzi dx^dyNd^N  M^^dx\dyidzi dx^dyt/dsif     (264) 

where  the  limits  of  integration  must  be  taken  consonant  with  the  volume 
and  energy  available. 

We  may  at  once  introduce  the  limits  imposed  on  the  positional  co- 
ordinates by  the  fact  that  the  total  volume  available  is  V,  and  hence  can 
immediately  rewrite  equation  (264)  in  the  form 

Y^  =  VM  3^  f (dx^dy^dz^ dx^dyifdzN  (265) 

To  obtain  the  limits  to  be  imposed  on  the  velocities  we  may  proceed 
as  follows.  Let  K  be  the  kinetic  energy  of  the  N  vapor  molecules,  Nspot. 
their  potential  energy  and  A^'e'pot.  the  potential  energy  of  the  molecules 
of  the  crystal,  so  that  the  total  energy  E  of  the  system  is  given  by  the 
equation 

E  =  K  +  A^fipot.  +  A^'fi'pot.  (266) 

Hence  in  a  microcanonical  ensemble  of  systems  in  which  the  total  energy 
lies  between  E  and  E  -j-  dE,  it  is  evident  that  the  kinetic  energy 

K  =  E  —  Ne,ot.  —  N'e\ou  (267) 

must  lie  in  the  range 

dK  =  dE  (268) 

The  kinetic  energy,  however,  is  connected  with  the  velocities  of  the 
molecules  by  the  equation 
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^x'  +  y^'+^i' +■■■.  +  ^-n'  +  Vn'  +  ^n'  =-^  (269) 

and  this  is  the  equation  of  a  2N  dimensional  hypersphere  of  radius 
{2K/M')^  which  in  accordance  with  equation  (259)  of  the  last  chapter 
has  the  volume 

37V  IN_ 

^'A\v)\^JJ^^\u)'  (270) 

Evidently  then  the  representative  point  for  the  velocities  must  lie  in 
the  spherical  shell,  between  the  radii  ^2K/M  and  ^2{K-\-  dK)/M  and 
this  volume  is  seen  by  differentiating  the  expression  in  equation  (270) 
to  have  the  value 


2N  3N 


j         j  dxjyjz^  ....  dx^dy^dz^  = j^ \\~m)  ^^'^dE.   (271) 

Substituting  in  equation  (265)  and  then  substituting  the  value  thus 
obtained  for  7;^  in  equation  (263),  we  obtain  as  our  final  expression  for 
the  y- weight  of  the  system  of  crystal  and  vapor  the  result 

163.  Most  Probable  Distribution  between  Crystal  and  Vapor. — 
The  above  expression  for  the  y-weight  is  obviously  dependent  on  the 
number  of  molecules  A^  in  the  vapor,  and  hence  we  can  determine  the 
most  probable  distribution  of  molecules  between  crystal  and  vapor  by 
putting  the  variation  of  y  equal  to  zero.  Using  for  greater  convenience 
the  logarithm  of  y,  introducing  the  Stirling  approximation  for  factorials 

\N=V2^(^Y  (273) 

r(f  +  l)  =  |(f)  (274) 


putting 


and  omitting  terms  which  are  negligible  when  A^  is  large,  we  easily  obtain 
from  equation  (272)  the  expression, 
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d\og  y  =  dNl  —  log  A^  —  log  0  —  3  log  /z  +  log  F  +  |  log  InMK 

3,      2>N    ,    ZNd\ogK\        „      ,_,. 

as  the  condition  for  the  most  probable  distribution  of  molecules  between 
crystal  and  vapor. 

164.  Vapor  Pressure  of  the  Crystal. — Introducing  the  relation 

dK=  (fi'pot.  — «Pot.)<5iV  (276) 

which  is  an  obvious  result  of  equation  (267),  giving  the  kinetic  energy 
of  the  vapor  molecules  the  value  of  which  it  has  at  the  low  concentra- 
tions involved 

K=^NkT  (277) 

and  putting  for  the  pressure  of  the  vapor 

P  =  ykT  (278) 

the  condition  for  maximum  probability  easily  reduces  to 

log  P  ^  -  '"^-  ~ '^""  +  f  log  feT  + 1  log  ^  -  log  o    (279) 

which  is  the  desired  expression  for  the  vapor  pressure  of  a  crystal  at  low 
enough  temperatures  so  that  the  vapor  acts  as  a  monatomic  gas  with  its  3A^ 
translational  degrees  of  freedom  fully  excited,  and  all  other  degrees  of 
freedom  frozen  in,  in  the  lowest  quantum  state. 

165.  Vapor  Pressure  o£  a  Glass. — Similar  methods  to  the  above  can 
be  used  to  obtain  an  expression  for  the  vapor  pressure  of  a  glass  or 
supercooled  liquid  at  very  low  temperatures.  In  the  case  of  a  glass, 
however,  an  exceedingly  important  difference  arises  when  we  consider 
the  number  of  ways  of  arranging  the  molecules  which  make  up  the  con- 
densed phase. 

In  a  perfect  crystal  having  a  given  size,  shape,  and  orientation  each 
molecule  occupies  a  definite  equilibrium  position  about  which  it  will 
oscillate  when  the  temperature  is  raised  and  the  total  number  of  differ- 
ent exemplars  of  the  crystal  which  can  be  obtained  will  be  found  by 
permutations  of  the  atoms  which  do  not  involve  changes  in  equilibrium 
positions.  In  the  case  of  a  supercooled  glass,  however,  especially  one 
composed  of  complicated  molecules,  there  may  be  a  number  of  ways  of 
fitting  the  molecules  together  and  obtaining  an  exemplar  of  the  glass,  and 
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we  must  take  account  of  this  in  determining  the  y-weight  for  a  given 
distribution  of  molecules  between  the  condensed  and  vapor  phases. 

In  determining  the  number  of  such  arrangements  of  the  molecules 
of  the  glass  which  involve  a  shift  in  the  positions  of  the  molecules,  we 
must  proceed,  however,  with  care.  We  are  applying  our  considerations 
to  a  system  which  has  all  of  its  degrees  of  freedom  frozen  in,  and  are 

assigning  in  any  case  a   weight   equal   to  the  area  ^'  =  I  pdq  to  each 

degree  of  freedom.  Hence  a  shift  in  the  position  of  a  molecule  which 
does  not  move  it  out  of  a  region  in  the  phase  space  already  allotted  to 
it  as  belonging  to  the  lowest  quantum  state  must  not  be  counted.  To 
determine  the  correct  number  of  shifts,  let  us  consider  the  properties 
of  the  glass  when  the  degrees  of  freedom  which  locate  the  positions  of 
the  molecules  have  been  excited  to  the  next  to  the  lowest  quantum  state. 
Since  in  accordance  with  the  considerations  of  Chapter  8  we  should 
assign  to  the  lowest  or  frozen-in  quantum  state  all  the  y-weight  corre- 
sponding to  systems  in  between  the  lowest  and  next  to  the  lowest  states, 
it  is  evident  that  rearrangements,  involving  shifts  in  the  equilibrium  posi- 
tions of  the  molecules,  when  the  systerti  is  thus  excited,  should  all  of 
them  be  counted  as  giving  new  exemplars  of  the  glass.  It  is  evident  that 
the  number  of  such  exemplars  of  the  glass  will  depend  on  the  number  of 
molecules  of  A'''  in  the  glass  so  that  w^e  may  represent  the  number  of 
exemplars  of  glass  by  the  symbol  4>{N'). 

166.  We  can  now  obtain  an  equation  for  the  vapor  pressure  of  a 
glass  in  the  same  way  that  we  derived  the  vapor  pressure  of  a  ci-ystal.  The 
expression  for  the  7-weight  of  a  given  distribution  of  molecules  between 
the  condensed  and  vapor  phases  will  evidently  have  the  same  form  as 
equation  (272)  except  that  it  will  be  multipHed  by  the  number  of 
exemplars  of  glass  cf)(N'),  and  this  leads  to  a  vapor  pressure  formula 
exactly    similar   to    (279)    except    that    it    contains    the    additional    term 

-,'r}. as  shown  below. 

•''^^- kf +  2^ogkT  +  -^\og-j^-\ogo j^, (280) 

The  nature  of  the  function  <f>(N')  can  now,  however,  easily  be  deter- 
mined. Since  we  assume  that  we  have  taken  a  large  enough  sample 
of  glass  so  that  the  vapor  pressure  is  not  appreciably  dependent  on  the 
number  of  molecules  A^'  in  the  glass,  it  is  evident  that  the  last  term  in 
equation  (280)  must  be  a  constant.     Hence  we  may  write 
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—^r -log  a  (281) 

where  o  is  a  constant  and  by  integration  obtain 

log  4>{N')  =  N'  log  a  +  const.  cf>(N')  =  ca-N'         (282) 

where  c  is  a  constant.  Furthermore,  substituting  (281)  in  (280)  we 
obtain 

log  P  ^  -  "^"-  -/'^'  +  I  log  feT  +  I  log  Jg^  -  log  a  -  log  a    (283) 

167.  This  development,  however,  also  provides  some  information  as 
to  the  magnitude  that  the  quantity  a  can  be  expected  to  assume.  Equa- 
tion (282)  gives  us  an  expression  for  the  number  of  ways  we  can  re- 
arrange the  A'^'  molecules  of  a  glass  and  still  obtain  an  exemplar  of  the 
glass.  If  we  increase  the  number  of  molecules  by  d,  the  number  of 
rearrangements  becomes 

4>{N'  ^d)  +5)  =ca^'  +  ^ 

=  a^<l,(N^)  (284) 

so  that  a*  is  the  factor  by  which  the  number  of  arrangements  is  in- 
creased when  we  increase  the  number  of  molecules  by  <3.  We  see  then 
that  a  may  be  regarded  as  the  average  number  of  ways  in  which  a  mole- 
cule can  be  rearranged  in  the  glass,  excluding  rearrangements  which  can 
be  obtained  by  permutations  of  the  atoms  and  excluding  rearrangements 
which  would  disappear  when  the  positional  degrees  of  freedom  for  the 
molecules  of  the  glass  have  been  excited  to  the  next  to  lowest  quantum 
state. 

We  see  that  on  this  basis  a  will  presumably  be  a  small  number  which 
increases  with  the  complexity  of  the  molecule.  This  conclusion  will  be 
of  interest  in  connection  with  a  consideration  of  the  entropy  of  liquids. 

168.  Applications  in  Thermodynamics. — The  above  conclusions  as 
to  the  vapor  pressure  of  crystals  and  glasses  at  very  low  temperatures 
are  of  great  interest  in  connection  with  the  postulates  made  by  the  so- 
called  third  law  of  thermodynamics  as  to  the  entropy  of  substances  in 
the  neighborhood  of  the  absolute  zero.  Indeed  they  permit  us  to  obtain 
a  very  precise  notion  of  the  implications  of  the  third  law. 

a.  Entropy  of  Different  Crystalline  Forms  of  the  Same  Substance. — 
In  the  first  place  it  can  be  shown  from  our  formula  for  the  vapor  pres- 
sure of  crystals,  equation  (279),  that  the  different  crystalline  forms 
of  the  same  substance  will  have  the  same  entropy  at  very  low  tempera- 
tures.   To  prove  this  consider  two  crystalline  forms  1  and  2  of  the  same 
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substance  and  transform  one  mol  of  the  former  into  the  latter  by  a 
reversible  evaporation,  followed  by  a  reversible  change  in  pressure  and  a 
reversible  condensation  at  the  new  vapor  pressure.  The  entropy  change 
accompanying  this  process  will  evidently  be 

AS  =^^p-+R  log  ^  -  ^^  (285) 

where  AHx  and  AH2  are  the  reversible  heats  of  evaporation  of  the  two 
crystals  and  Pi  and  P2  are  their  vapor  pressures.  We  may  evidently 
substitute,  however, 

T  T     ~  ^'  f  ^^ 


^"'       ''"'-  =  N^ ..^^-W 


T 


(286) 


where  N  is  Avogadro's  number  and  e/  and  go'  are  the  potential  energies 
of  the  molecules  in  the  two  crystalline  forms,  and  e  in  the  gaseous  form, 
and  it  will  be  seen  from  equation  (279)  that  this  quantity  will  just  cancel 
the  term  containing  the  ratio  of  the  vapor  pressures  R  log  (F1/F2)  leading 
to  the  desired  result. 

AS=^0  '  (287) 

b.  Entropy  of  Crystalline  Compounds  and  Elements. — It  is  also  easy 
to  see  that  there  will  be  no  change  in  entropy  at  very  low  temperatures 
if  we  transform  a  crystalline  compound  into  its  elements  provided  they 
are  also  in  the  crystalline  form.  We  can  think  of  the  transformation 
as  carried  out  by  a  reversible  evaporation  of  the  compound,  followed 
by  a  reversible  change  in  the  pressure  of  the  vapor  of  the  compound  to 
that  pressure  which  it  has  in  equilibrium  in  the  vapors  from  the  crystals 
of  the  elements,  followed  by  a  reversible  condensation  into  the  crystals 
of  the  element.  It  is  evident  that  a  formula  can  be  derived  for  the  vapor 
pressure  of  the  compound  in  equiHbrium  with  the  vapors  from  the  crystals 
of  its  elements  by  exactly  the  same  methods  as  equation  (279)  for  the 
vapor  pressure  over  a  single  crystal,  and  indeed  the  formula  will  have 
the  same  form  as  (279)  except  that  the  quantity  e'pot.  will  be  the  sum  of 
the  potential  energies  in  the  crystals  of  the  elements  of  the  atoms  formed 
from  one  molecule  of  the  compound.  Using  this  result  we  can  obviously 
proceed  as  in  the  last  paragraph  and  show  here  too  that 

AS  =  0  (288) 

for  the  transformation. 

c.  Entropy  of  Glasses  at  Low  Temperatures. — It  is  also  important 
to  note  that  our  two  formulse  (279)  and  (283)  for  the  vapor  pressures  of 
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crystals  and  glasses  lead  at  once  by  the  method  used  above  to  the  con- 
clusion that  the  entropy  of  a  supercooled  glass  at  very  low  temperatures 
is  greater  than  that  of  the  same  material  in  the  form  of  a  pure  crystal  by 
an  amount 

AS  =  R  log  a  (289) 

Since  we  have  seen  above  that  a  is  apparently  a  small  number  increas- 
ing with  the  complexity  of  the  molecule,  we  can  form  at  once  an  idea 
as  to  the  order  of  magnitude  of  the  differences  we  can  expect  between 
the  entropy  of  crystals  and  glasses.  These  results  are  in  agreement 
with  the  more  qualitative  ideas  of  Lewis  and  Gibson  ^  and  the  experi- 
mental work  of  Gibson  and  Giauque  ^  on  the  entropy  of  glasses  and  crystals 
of  glycerine.^ 

d.  The  Entropy  of  Monatomic  Gases. — Equation  (279)  for  the  vapor 
pressure  of  crystals  at  low  temperatures  provides  the  most  satisfactory 
method  of  deriving  the  Sackur-Tetrode  equation  for  the  entropy  of 
monatomic  gases.  The  change  in  entropy  in  changing  one  mol  of  sub- 
stance at  low  temperature  from  crystal  to  vapor  at  pressure  P  will  evi- 
dently be 

AS  =  ^  +  R\og^  (290) 

where  AH  is  the  reversible  heat  of  evaporation  and  Pvap.  is  the  vapor 
pressure.     Introducing  the  obvious  relation 

AH  =  N  fe  —  e'  -\-^kT\  (291) 

and  substituting  equation  (279)   for  the  vapor  pressure  we  obtain 

AS  =  ^R  log  T  -  R  log  P  -^^  Rlog  M  +  R  log^  ^^^3       -]-^R  —  Rlogo      (292) 

If  we  take  the  entropy  of  the  crystal  itself  as  zero  as  is  customary,  the 
above  expression  is  the  value  for  the  entropy  of  the  gas  itself.  For  gases 
which  actually  are  monoatomic  0  is  unity  and  the  expression  reduces  to 
that  of  Sackur  with  the  Tetrode  value  for  the  constant.  There  seems, 
however,  no  reason  why  the  Sackur-Tetrode  equation  is  not  also  valid  for 
gases  in  which  0  is  not  unity  provided  we  are  at  low  enough  tempera- 
tures so  that  rotations  of  the  molecules  are  frozen  in.  (See  Chapter  9, 
Section  130.) 

^  Lewis  and  Gibson,  Jour.  Amer.  Chem.  Soc,  42,  1529  (1920). 
'Gibson  and  Giauque,  Jonrn.  Amer.  Chcm.  Soc,  45,  93  (1923). 
'The  above  treatment  of   the   vapor   pressure  and  entropy  of   glasses   is   due  to 
Pauling  and  Tolman,  Journ.  Amer.  Chcm.  Soc,  47,  2148  (1925). 
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The  foregoing  results  contain  the  full  import  of  the  third  law  of 
thermodynamics  as  far  as  concerns  the  usual  calculations  of  entropy  and 
free  energy  values. 

The  treatment  given  in  this  chapter  has  been  based  on  the  assumption 
that  each  frozen  in  degree  of  freedom  contributes  the  factor  h  to  the 
total  y-weight  of  a  state  of  the  system.  The  possibility  has  recently  been 
suggested,  however,  that  certain  degrees  of  freedom  which  have  been  so 
treated  may  contribute  a  somewhat  larger  factor  to  the  y-weight.  If  this 
suspicion  should  prove  to  be  true,  the  above  formulae  would  be  subject  to 
small  modifications. 

For  example,  if  the  lowest  rotational  state  of  a  vapor  molecule  should 
have  the  a-priori  probability  po,  conditions  in  the  crystal  remaining  as 
assumed,  the  Sackur  Tetrode  formula  (292)  would  contain  an  additional 
term  -|-  R  log  po.  Since  po,  if  not  unity,  would  presumably  be  a  small 
integer,  it  is  doubtful  if  our  present  entropy  data  are  accurate  enough  to 
establish  the  presence  of  such  a  term. 


Chapter  14 

Gaseous  Equilibria  at  Temperatures  of  Constant 
Specific  Heat 

169.  Introduction. — The  investigations  of  the  last  chapter  have  given 
us  values  for  the  entropy  of  crystals  and  gases  at  the  absolute  zero,  and 
hence  make  possible  the  calculation  of  the  entropy  change  accompanying 
a  gaseous  reaction  at  higher  temperatures,  by  the  consideration  of  a 
cycle  in  which  we  start  with  the  initial  substances  at  the  absolute  zero, 
vaporize,  raise  to  the  higher  temperature,  permit  the  reaction  to  occur, 
cool  the  product  to  the  absolute  zero,  and  then  condense  to  crystalline 
form.  Combining  the  entropy  change,  thus  obtained  for  a  gaseous  re- 
action, with  the  heat  change  we  can  then  calculate  the  free  energy  change 
and  equiHbrium  constant  of  the  reaction. 

Such  a  thermodynamic  method  of  calculation  has  the  advantage  of 
being  familiar  and  of  permitting  the  use  of  actual  empirical  rather  than 
theoretical  formulae  for  the  specific  heats  of  the  gases  involved  when  the 
temperature  is  raised.  Nevertheless,  it  is  evident  that  the  methods  of 
Chapter  12  should  make  it  possible  to  obtain  a  direct  statistical  mechanical 
calculation  of  gaseous  equilibria  by  determining  what  state  of  chemical 
combination  has  the  greatest  probability  or  y-weight.  Such  a  statistical 
mechanical  method  is  theoretically  illuminating  and  often  has  practical 
advantages. 

The  treatment  given  below  is  substantially  that  of  Ehrenfest  and 
Trkal.i 

170.  The  System  to  be  Considered. — Let  us  consider  a  gaseous 
system  of  volume  V  and  temperature  T,  composed  of  X,  Y,  Z,  etc.,  atoms 

of  different  kinds,  combined  into  A^'i  A^2 ^ i  molecules  of  different 

kinds.  And  let  us  consider  that  the  temperature  T  is  such  that  all  the 
degrees  of  freedom  can  be  considered  as  either  fully  excited  or  fully 
frozen  in.  Thus  a  monatomic  molecule  will  be  taken  as  having  3  degrees 
of  freedom,  corresponding  to  its  translational  motion,  a  diatomic  mole- 
cule as  having  5  degrees  of  freedom  corresponding  to  translation  and 
rotation  about  two  axes,  and  a  polyatomic  molecule  as  having  6  degrees 

*  Ehrenfest  and  Trkal,  Proc.  Amsterdam  Acad.,  23,  162  (1920). 
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of  freedom  corresponding  to  translation  and  rotation  about  all  three 
axes.  With  these  assumptions  we  shall  evidently  have  to  be  in  a  tem- 
perature range  of  constant  specific  heats  Cv  =  S/2R,  5/2R  and  3R  corre- 
sponding respectively  to  monatomic,  diatomic  and  polyatomic  molecules. 
Vibrational  degrees  of  freedom  will  be  taken  as  completely  frozen-in. 

We  shall  use  the  symbols  M^,  P^,  Q^,  R^,....  Mj,  Pj,  Qj,  Rj  for  the 
masses  and  moments  of  inertia  of  the  different  kinds  of  molecules. 
For  monatomic  molecules  all  three  moments  of  inertia  P,  Q,  R  will  be 
equal  to  zero.  For  diatomic  molecules  we  shall  have  P  =  Q,  R=zO; 
and  for  polyatomic  molecules  all  three  moments  will  in  general  be 
different. 

For  the  total  number  of  degrees  of  freedom  we  shall  evidently  have 
3(X  -{-  Y  -\-  Z  -{-....) .  Some  of  these  degrees  of  freedom  will  be  fully 
excited  and  some  frozen-in.  For  the  number  of  fully  excited  degrees  of 
freedom  we  shall  use  the  symbol  F  and  shall  write 

F  =  ^Nifi  fi  =  S,5  or  6.  (293) 

» 

where  Ni  is  the  number  of  molecules  of  the  i'th  kind  and  fi  has  one  of 
the  values  indicated  depending  on  whether  the  molecule  is  monatomic, 
diatomic  or  polyatomic.  For  the  number  degrees  of  freedom  which  are 
frozen  in,  we  shall  evidently  have  3{X  -\-  Y  -\-  Z  -\-  ...  .)  —  F. 

For  the  symmetry  factors  for  the  different  kinds  of  molecules  we  shall 
use  the  symbols  Oi,  02, .  . .  .  Oj. 

171.  Probability  of  a  Given  State  of  Combination. — In  accordance 
with  the  principles  of  Chapter  12,  we  can  now  evidently  write  for  the 
y-weight  of  a  given  state  of  chemical  combination  of  the  atoms  in  the  gas 

\X  \Y  \Z 

'       '       '-  ^,ziX-\-Y  +  z  +  ...)-Fy^      (294) 


liVi  \N^....  \Ni  or'^^02^=  ...Oi 


N, 


where  yi  is  the  area  of  the  hyper-surface  in  the  /-space  corresponding  to 
fully  excited  degrees  of  freedom  over  which  the  representative  point  for 
the  system  can  be  moved,  without  changing  the  state  of  chemical  com- 
bination and  without  permuting  the  atoms. 

172.  The  value  of  y^  can,  however,  easily  be  seen  to  be  given  by 
the  multiple  integral  for  the  coordinates  and  momenta  q^ . .  .  .  p  p  for  the 
fully  excited  degrees  of  freedom 

y,=^....^dq^....dp^  (295) 

provided  we  use  the  proper  limits  of  integration.    In  the  case  of  the  three 
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positional  coordinates  for  each  molecule  we  can  evidently  integrate  over 
the  whole  volume  V  and  this  will  introduce  the  factor  V(.Ni  +  N2  +  . . ..  +  Nj) 
In  the  case  of  the  two  angular  coordinates,  which  determine  the  direc- 
tion of  the  axis  of  a  diatomic  molecule,  we  must  integrate  over  the  sur- 
face of  the  whole  solid  angle  47i  and  shall  hence  obtain  the  further  factor 
(An)^'"  where  Na  is  the  number  of  diatomic  molecules  and,  in  the  case 
of  the  three  angular  coordinates  necessary  to  determine  the  orientation 
of  the  axes  of  the  polyatomic  molecules,  will  evidently  obtain  on  integra- 
tion the  factor  {Sti^)^"  when  Np  is  the  number  of  polyatomic  molecules. 
Hence  integrating  first  of  all  the  coordinates,  we  can  rewrite  equa- 
tion (295)  in  the  form 

y^—yiN,  +  N,  +  ...  +  Ni)  (47i)^<'i87i^)^^  f...  Cdp^-.-dp^^    (296) 

We  must  next  inquire  into  the  limits  of  integration  for  the  momenta 
pi ... .  pp.  The  values  of  these  momenta  will  evidently  determine  the 
kinetic  energy  of  the  system  in  accordance  with  the  equation 

^  =  2^  +  H:  +  ----  +  It,  (297) 

where  the  constants  A^.  A2 . .  .  .  Af  are  evidently  the  masses  and  moments 
of  inertia  MiPiQiRi ....  MjPjQjRj,  and  the  F-dimensional  volume  in 
which  the  phase-point  would  have  to  be  for  any  value  of  the  kinetic 
energy  less  than  K  would  evidently  by  a  hyper-ellipsoid,  having  a  volume 
in  accordance  with  Section  159  of  Chapter  12,  of  the  amount 

y^  =  4^^^  Vj,rU ^.  (298) 

We  are  interested,  however,  in  a  microcanonical  ensemble  of  systems 
which  have  total  energies  between  E  and  E  -{-  dE,  and  since  the  potential 
energy  of  the  systems  is  definitely  fixed  by  the  state  of  chemical  combina- 
tion, it  is  evident  that  the  kinetic  energy  must  be  in  the  range 

d^  =  dE  (299)  . 

Hence  by  differentiating  equation  (298)  with  respect  to  K  we  obtain 
for  the  volume  of  the  ellipsoidal  shell  of  interest 

F     VC^TiK)"       , 

dVp  =  y  —y^p — r-  VA,A.....Af  K-^dE  (300) 

^(1  +  0 
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Substituting  in  (296),  we  then  obtain 


y,  =  V^^^  (4;i)'^-^  (Sn-r"^    ^,^1''^^\VA,A,  ...A,   K'^dE        (301) 


This  expression  can,  however,  be  somewhat  simpHfied  if  we  note  that 
the  constants  A^  .  . .  .  Af  are  evidently  in  accordance  with  equation  (297) 
the  masses  and  moments  of  inertia  of  the  molecules.  The  mass  for  each 
molecule  enters  three  times  into  the  product  corresponding  to  three  com- 
ponents of  momentum  and  each  moment  of  inertia  enters  once.  Hence  if 
we  put 

=  VMi^  (for  monatomic  molecules) 

=  4n  y/'Mi^Pi"  (for  diatomic  molecules)         (302) 

=  ^71-  y/Mi^PiQiRi  (for  triatomic  molecules) 

We  can  rewrite  equation  (301)  in  the  form 

^■  =  ^-"'2^/F^S(?°'    )g-'-^£  (303) 

where  Hi  indicates  a  product  taken  for  all  the  kinds  of  molecules  present. 

173.  Substituting  now  in  equation  (294),  and  taking  for  convenience 
the  logarithm,  we  obtain  as  an  expression  for  the  total  /-weight  of  a 
given  state  of  chemical  combination 

log  7  =  log  ( [X   |F   |Z  .  .  .)/i3(A'  +  7  +  z  +  . . .)  rf^ 

—  2  log  |i^  —  SiVi  log  Oi  —  F log  h 

i  i 

+  SiVi  log  F  +  log  I  -f  1^  log  27iK 

-logr(^|  +  lj+2iViloga,-logi^  ^3^4^ 

where  for  convenience  constant  quantities  have  been  collected  in  the  first 
term. 

This  expression,  however,  can  be  written  in  a  simpler  form,  if  we 
denote  the  constant  first  term  by  the  symbol  /,  introduce  equation  (293) 
for  the  number  of  fully  excited  degrees  of  freedom 

F  =  Y.Nifi 
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and  introduce  the  approximations 

log  |A^  =  A^  (log  A^— 1) 


log  ^  (f  +  1 )  -  log  I ^/^  -  f  ( log  f  -  1 ) 


(305) 


We  obtain 

log  7  =  /  —  ^Ni  (log  A^i  —  1 )  —  ^Ni  log  Oi  —  ^Nifi  log  h 

i  »  » 

+  ^Ni  log  F  +  log  1^  +  ^Nifi  log  V2^ 
—  |(log|  — A  +2iVaogai-log/v 

or  collecting  terms  and  introducing  a  new  quantity  ai,  which  is  a  property 
of  molecules  of  the  fth  kind,  defined  by  the  equation 

log  ai=  —  log  Oi  —  fi  log  h-\-  fi  log  ^/'Zn  -\-  log  ai 
or  (306) 

"^  =  ^  v^r; 

We  can  write  as  our  final  expression  for  the  total  y-weight  of  any  given 
state  of  chemical  combination 

\ogy  =  I  —  ^Ni{\ogNi-\)-^  ^log|_l^ 

+  ^Ni  Aog  F  +  ^  log  i^  +  log  a\  (307) 

where    the    two    terms    logF/2    and    — logi^    have    been    dropped    as 
negligible  compared  with  F/2  log  F/2  and  F/2  log  K. 

174.  Change  in  Probability  with  Chemical  Reaction. — If  a  chem- 
ical reaction  takes  place  between  the  molecules  of  the  gas  the  probability 
of  the  state  as  given  by  equation  (307)  will  in  general  be  altered,  since 
the  number  of  molecules  Ni  in  the  different  states  will  be  altered.  For  the 
change  in  the  log  y  accompanying  a  reaction  we  can  then  write 

d\ogy  =  —  ^dNi  log  Ni  —  ^  log| 

+  ^dNi  (logV  +  ^logK  +  log  ai\ 

+  llNi  ^^  (308) 
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It  is  evident  from  equation  (293),  however,  that  we  can  put 

dF  =  ^SNifi  (309) 

i 

Furthermore,  if  we  denote  the  potential  energy  of  a  molecule  of  the  i'th 
kind  by  £i  we  can  write 

K  =  E  —  ^Niei  (310) 

» 
and  put 

dK  =  —  ^dNiei  (311) 

i 

Finally  let  us  specify  the  chemical  reaction,  which  we  are  considering 
as  producing  the  change  in  composition,  by  stating  that  Vi  is  the  num- 
ber of  molecules  of  the  /th  kind  that  are  produced  when  the  reaction 
as  written  takes  place.  Thus  if  we  write  the  reaction  in  the  familiar 
form  aA  -\-  bB  -{-...  =  cC  -}-  dD  -\-  ..  . 

the  quantities  Vi  are  the  numbers  a,  b,.  .  .  .c,  d.  . .  .,  taken  with  the  proper 
sum,  which  give  the  number  of  molecules  entering  into  the  chemical  re- 
action.   We  may  then  write 

dNi  =  VidQ  (312) 

if  we  denote  by  SQ  the  extent  to  which  the  reaction  occurs.  Substituting 
equations  (309),  (311)  and  (312)  in  (308),  we  obtain 

^  log  7  1=  ^Q  <  —  2n  log  Ni  —  ^Vi  ^  log  -X- 

-f  2n  Aog  V-\-^\ogK  +  log  a\ 

r  (313) 


(!¥)^ 


K   J 

for  the  change  in  probability  with  chemical  reaction. 

175.  Condition  of  Chemical  Equilibrium. —  The  condition  for 
chemical  equilibrium  with  respect  to  the  reaction  in  question  is  evidently 
obtained  by  setting  expression  (313)  equal  to  zero,  and  thus  securing 
the  chemical  state  of  maximum  probability.  Doing  this,  rearranging  and 
noting  F  =  1.Nifi,  we  obtain 

2n  log  ^  =  -  ^  2ne,  -f-  2n  ^^og^-h  2r,  log  a,     (314) 

The  quantity  on  the  left  hand  side  of  this  equation  is,  however,  evi- 
dently equal  to  log  /v\.,  where  Kc  is  the  familiar  equilibrium  constant  for 
the  reaction,  expressed  with  concentrations  in  numbers  of  molecules  per 
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unit  volume.  Furthermore  we  can  evidently  write  from  the  principle  of 
equipartition  of  energy  which  holds  for  the  fully  excited  degrees  of 
freedom 


F 


(315) 


Substituting   in   equation    (314),    we   obtain   as   our    final   expression 
for  the  eqidlibriuin  constant  of  the  reaction 


1  fi 

log  Kc  =  —  -7-^  ^Vifii  +  2rf  ^  log  kT  +  ^Vi  log  at 


(316) 


In  applying  this  equation  it  should  be  noted  that  Kc  is  the  equilibrium 
constant  expressed  with  concentration  in  numbers  of  molecules  per 
cubic  centimeter,  and  that  HviSi  is  the  change  in  potential  energy  and 
kTI^Vifi/2  is  the  change  in  kinetic  energy  when  Vi  molecules  react.  Fur- 
thermore, in  accordance  with  equations  (302)  and  (306),  it  should  be 
noted  that  the  theoretical  "chemical  constants"  have  the  values 

-{-,,-)  (317) 

for  monatomic,  diatomic  and  polyatomic  molecules  respectively. 


=  ^vjr:^ 


=  --  VAU'P,Q,R] 

Oi 


176.  Some  interesting  consequences  of  equations  (316)  and  (317) 
may  be  noticed.  In  the  first  place  at  low  temperatures  the  preponderat- 
ing influence  of  the  first  term  throws  the  equilibrium  in  the  direction 
of  decreased  potential  energy,  while  at  high  temperatures  the  influence 
of  the  second  term  throws  the  equilibrium  in  the  direction  of  increased 
number  of  fully  excited  degrees  of  freedom,  that  is  in  the  direction  of 
dissociation. 

With  regard  to  the  "chemical  constants"  as  given  by  equation  (317), 
it  should  be  noted  of  course  that  transformations  must  be  made  in  order 
to  obtain  the  usual  values  of  the  chemical  constants  used  in  thermo- 
dynamic calculations.  It  should  also  be  noted  that  the  words  monatomic, 
diatomic,  and  polyatomic  are  used  in  connection  with  these  constants 
in  a  purely  conventional  sense  to  indicate  the  number  of  excited  degrees 
of  freedom.  Thus  hydrogen  (H2)  at  low  temperatures  would  be  regarded 
as  monatomic,  and  carbon  dioxide  (O  =  C  =:  O)  which  is  known  from 
spectral  data  to  have  a  linear  structure  would  be  regarded  as  diatomic. 
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The  interesting  effect  of  the  symmetry  factor  Oi  in  decreasing  the 
tendency  of  a  molecule  to  form  should  also  be  noted.  Thus  "other 
things  being  equal"  there  is  twice  the  tendency  of  a  molecule  of  the 
linear  structure  A  A  B  to  form  as  for  one  of  the  structure  ABA.  This 
can  be  seen  qualitatively  from  the  consideration  that  the  first  molecule 
could  form  by  the  two  steps  A  -\-  A  B  and  A  A  -\-  B,  but  the  second  only 
by  the  step  A  B  -\-  A. 

177.  Application  to  Thermal  Ionization. — As  an  important  ex- 
ample of  the  application  of  these  methods,  we  may  calculate  the  degree 
of  thermal  ionization  of  a  metal  vapor  into  positive  ions  and  electrons 
at  high  temperatures.  This  is  a  problem  of  great  interest  for  astro- 
physics.^ 

Consider  the  vapor  of  a  metal  A  which  can  exist  in  its  normal  un- 
ionized state  or  dissociated  into  a  positive  ion  and  electron  in  accordance 
with  the  reaction 

A  =  A^  +  E- 

We  can  immediately  apply  equation  (316)  to  this  reaction.  In  doing  so 
we  may  neglect  the  slight  differences  between  the  masses  and  moments 
of  inertia  of  A  and  A''  so  that  the  chemical  constants  for  these  two  con- 
stituents will  balance  out,  except  for  the  fact  that  it  would  seem  reason- 
able to  take  the  symmetry  factor  o  for  the  atom  A  as  equal  to  the  num- 
ber of  equivalent  electrons  held  in  the  most  lightly  bound  orbits,  and 
hence  susceptible  to  ionization.  Applying  equation  (316)  and  noting 
equations  (317),  with  the  modification  resulting  from  the  introduction  of 
a  symmetry  factor  into  the  expression  for  a  monatomic  molecule,  we 
evidently  obtain 

log       (^)  kT  ^  2   ^        ^^        h^ 

or 

(A) 1? '     '"^  ^^^^^ 

where  the  concentrations,  in  molecules  per  cubic  centimeter,  are  indicated 
by  enclosing  the  symbol  for  the  atom  in  brackets,  e^  is  the  work  of 
ionization,  m  is  the  mass  of  the  electron  and  o  is  the  symmetry  factor 
for  the  unionized  atom. 

We  shall  find  a  later  application  for  this  equation. 

'See  Fowler  and  Milne,  Monthly  Notices  R.  A.  S.,  83,  407  (1923). 


Chapter  15 

Some  General  Principles  Regarding  Molecular 
States  and  Molecular  Processes 

178.  Introduction. — In  our  preceding  statistical  mechanical  discus- 
sions we  have  been  mainly  interested  in  determining  the  numhers  of 
molecules  of  a  system  in  different  states  when  the  system  as  a  whole  is 
in  a  condition  of  equilibrium  and  have  paid  scant  attention  to  the  proc- 
esses by  which  molecules  change  from  one  state  to  another.  It  is  evi- 
dent, however,  that  even  in  a  condition  of  equilibrium,  molecules  will  be 
continually  changing  from  one  state  to  another,  the  constancy  of  the 
quotas  in  the  different  possible  states  being  maintained  by  some  form  of 
opposed  processes  which  balance  each  other.  Furthermore,  in  the  case 
of  systems  which  are  not  in  a  state  of  equilibrium,  where  gross  physical 
chemical  changes  are  taking  place,  it  is  evident  that  the  progress  of 
these  changes  is  intrinsically  dependent  on  the  mechanisms  by  which 
molecules  can  change  from  one  state  to  another.  Hence  it  can  easily  be 
seen  that  a  complete  understanding  of  the  processes,  by  which  systems 
approach  the  state  of  equilibrium  and  maintain  this  state  after  it  is 
reached,  is  dependent  upon  a  knowledge  of  the  nature  of  the  individual 
mechanisms  by  which  molecules  change  from  one  state  to  another. 

The  present  chapter  will  be  devoted  to  a  somewhat  abstract  classifica- 
tion of  molecular  states  and  molecular  processes  and  the  discussion  of 
some  very  general  principles  as  to  the  nature  of  molecular  processes.  The 
next  two  chapters  will  investigate,  from  this  special  point  of  view  of 
the  mechanism  of  molecular  processes,  two  important  cases  of  equi- 
librium, and  later  chapters  will  use  a  knowledge  of  the  laws  governing 
these  processes  to  investigate  the  rates  of  gross  physical  chemical  change. 

179.  The  Specification  o£  Molecular  States. — In  most  statistical 
mechanical  considerations  we  treat  systems  which  are  dilute  enough  so 
that  we  can  specify  the  states  of  the  individual  molecules  composing  the 
system  without  any  ambiguity  such  as  would  be  introduced  by  the  con- 
tinuous action  of  closely  neighboring  molecules.^ 

*The  necessity  of  sufficient  dilution  does  not  thereby  completely  rule  out  the 
treatment  of  solids,  liquids  and  compressed  gases  by  the  methods  under  discussion, 
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We  shall  denote  a  given  state  of  a  molecule  by  giving  the  values 
of  the  n  generalized  coordinates  (t/i  ....  q,,)  and  n  momenta  (  pi  ...  .  pn), 
which  determine  not  only  the  location  and  motion  of  the  center  of  gravity 
of  the  molecule,  but  also  the  positions  and  motions  of  its  internal  parts. 
And  in  order  to  assist  in  visualization  and  in  providing  a  convenient 
quasi-geometrical  language,  we  shall  use  our  earlier  device  of  represent- 
ing the  state  of  a  molecule  by  the  location  of  a  representative  point  in  a  2w 
dimensional  phase-space  (//-space)  corresponding  to  the  2n  coordinates 
and  momenta  of  the  molecule. 

In  general  the  state  of  a  molecule  will  be  specified  not  by  the  precise 
location  of  a  single  point  in  the  phase-space,  but  rather  by  a  small  region 
in  the  phase-space  within  which  the  representative  point  will  lie.  Thus 
in  the  case  of  a  molecule  having  n  "classical"  degrees  of  freedom  the 
state  would  be  specified  by  the  infinitesimal  region  dq^ ....  dpn  in  which 
the  representative  point  was  located,  while  the  state  of  a  molecule  with  n 
"quantized"  degrees  of  freedom  would  usually  be  sjjecified  by  a  finite 
region  of  magnitude  h^.  In  making  these  specifications  of  state,  regions 
in  the  phase-space  of  just  the  same  volume  will  be  used  for  the  different 
conditions  of  a  molecule  in  order  to  preserve  an  equivalence  between  the 
probabilities  of  different  possible  molecular  states. 

180.  The  Classification  of  Molecular  States. — We  shall  first  find 
it  convenient  to  introduce  a  rather  abstract  classification  of  molecular  states 
into  congruent,  enantiomorphous  and  reverse  states. 

a.  Congruent  States. — Since  a  mere  change  in  the  spatial  location 
or  orientation  of  a  molecule,  leaving  undisturbed  its  internal  configura- 
tion and  the  magnitudes  and  relative  directions  of  the  velocities  of  its 
different  parts,  will  not  in  general  alter  its  energy  content  nor  the  nature 
of  its  behavior  towards  other  molecules,  which  are  suitably  placed  or 
radiation  which  is  properly  disposed,  we  shall  often  desire  to  group 
together  such  molecules  and  shall  speak  of  them  as  being  in  congruent 
states.  Indeed  it  is  often  customary  to  si:>eak  of  molecules  as  being  in 
the  same  state  when  more  strictly  they  are  in  congruent  states. 

Our  definition  of  congruence  can  be  constructed  from  analogy  with 
the  use  of  the  term  in  geometry.  T^i'o  states  of  a  molecule  tvill  he  called 
congruent,  if  the  molecule  can  he  changed  from  one  state  to  the  other, 
by  a  movement  zvhich  does  not  alter  the  relative  positions  or  velocities 
of  its  parts  and  does  not  change  its  relation  to  any  external  field  of 
force  in  which  it  may  he  situated.  In  carrying  out  such  imaginary  move- 
ments from  one  congruent  state  to  another,  the  velocities  of  the  differ- 
ent parts  of  the  molecule  and  of  its  center  of  gravity  can  conveniently 

since  the  state  of   interest  of  the   molecule  may  be   much  more  dependent  on   its 
internal  configuration  than  on  its  proximity  to  other  molecules. 
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be  thought  of  as  geometric  vectors,  rigidly  attached  to  the  molecule,  so 
that  they  will  partake  in  the  translation  and  rotation  of  the  molecule 
as  a  whole.  Congruence  for  molecular  physics  thus  means  the  same  as 
congruence  for  geometry  with  the  additional  restrictions  as  to  velocities 
and  external  field. 

b.  Enantiomorphous  States. — It  is  also  possible  to  generalize  the  idea 
of  enantiomorphous  states.  Tinfo  states  of  a  molecule  will  he  called  en- 
antiomorphous when  one  of  the  states  is  congruent  with  the  mirror  image 
of  the  other.  This  idea  of  enantiomorphous  states  is  of  course  a  com- 
mon one  in  organic  chemistry  and  is  here  merely  extended  to  include 
the  idea  of  the  reflection  of  the  vectors  giving  the  velocities  of  the  parts 
of  the  molecule  as  well  as  those  that  determine  their  relative  ix)sition. 
It  is  interesting  to  note  that  this  addition  leads  to  enantiomorphism  in 
the  case  of  molecules  which  do  not  contain  any  asymmetric  atom. 

c.  Reverse  States. — In  addition  to  the  ideas  of  congruent  and  enanti- 
omorphous states  adapted  from  geometry,  we  must  add  for  the  purposes 
of  physics,  the  further  idea  of  reverse  states,  which  difl^er  by  a  re- 
versal in  direction  of  the  components  of  velocity.  Tzt/o  states  of  a  m^ole- 
cule  will  he  called  reverse,  if  the  molecule  can  he  changed  from  one  of 
the  states  to  a  state  congruent  with  the  other  hy  a  reversal,  in  direction, 
of  all  components  of  velocity  ztnthout  change  in  magnitude  or  change 
in  the  relative  positions  of  the  constituents  of  the  molecules. 

The  possibility  of  the  existence  and  stability  of  molecules  in  reverse 
states  is  guaranteed  by  the  principle  of  dynamical  reversibility  to  which 
we  shall  now  turn  our  attention. 

181.  The  Principle  of  Dynamical  Reversibility. — The  possibility 
of  the  "time  reversal"  of  any  dynamical  motion  is  a  well-known  me- 
chanical principle.  The  proof  of  this  principle,  that  the  reverse  of  any 
given  motion  of  a  system  is  also  a  possible  motion  of  the  system,  can 
easily  be  obtained  if  we  consider  the  equations  of  motion  in  the  Lagrangian 
form   (see  Chapter  2,  Section  13) 

.4i-^  =  0'  .•=1,2,3...,,  (319) 

Rewriting  these  equations  so  as  to  show  the  dependence  of  the  Lagrangian 
function  L   and  the  external   forces   Qi  on   the   generalized   coordinates 

Qi . . .  .  Qn  and  generalized  velocities  —7—  ....  -j--  we  obtain 
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r 

Since  in  general,  however,  L  is  quadratic  in  the  velocity  terms  —j-,  equa- 
tion  (320)  can  obviously  be  rewritten  in  the  form 
d  d 

[d{—t)i 
d 


Hence  if 


qi=<f>i(t)  (322) 


is  a  solution  of  the  equations  of  motion,  it  is  evident  from  the  identity 
of  the  forms  of  the  two  equations  (320)  and  (321)  that 

qi  =  <j>i(—t)  (323) 

is  also  a  possible  solution.  The  result  means  that  if  at  any  given  instant 
we  reverse  the  direction  of  all  the  velocities  of  such  a  dynamical  system, 
it  will  then  retrace  its  former  path,  having  at  each  instant  on  the  reverse 
path  the  same  position  and  reverse  direction  of  its  velocities  as  at  the 
corresponding  time  in  the  original  motion. 

182.  Limitations  to  the  above  "principle  of  reversibility"  will  obviously 
arise  if  the  Lagrangian  function  L  or  the  external  forces  Qi  should  con- 
tain terms  of  the  first  or  other  odd  degree  in  the  velocities.  An  interest- 
ing example  of  this  occurs  in  the  case  of  the  motion  of  a  charged  particle 
in  an  external  magnetic  field.  Here  a  reversal  in  the  direction  of  that 
component  of  velocity  which  is  perpendicular  to  the  field,  will  lead  to  a 
reversal  in  the  direction  of  the  force  on  the  particle  and  thus  exclude 
the  possibility  of  the  reverse  motion.  Thus,  as  has  been  pointed  out 
in  this  connection  by  Einstein  and  Ehrenfest,  if  we  consider  a  Bohr 
hydrogen  atom  in  an  external  magnetic  field  with  the  electron  rotating 
in  a  plane  perpendicular  to  the  field,  the  time  reversal  of  this  motion 
will  not  be  possible  since  the  magnetic  force  will,  for  one  direction  of 
rotation  be  acting  towards  the  nucleus,  and  for  the  other  direction  of 
rotation  away  from  it.  Hence  one  of  the  orbits  will  have  a  different 
velocity  of  rotation  than  the  other. 

Special  attention,  however,  should  be  called  to  the  possibility  and  in- 
deed perhaps  propriety  of  including  in  the  time  reversal,  a  reversal  in 
the  direction  of  the  electric  current  which  produces  the  external  magnetic 
field.  Under  these  conditions  the  complete  time  reversal  of  the  original 
motion  can  then  be  secured. 
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183.  For  our  present  purposes,  the  thing  of  interest,  arising  from 
the  above  considerations,  is  to  note  that  in  the  absence  of  an  external 
field,  the  reverse  of  any  state  of  a  molecule  is  also  a  dynamically  pos- 
sible state.  In  addition  it  is  easy  to  see  that  if  one  of  the  states  is 
quantized,  then  the  other  v^ill  also  be  quantized  with  the  same  quantum 
numbers,  since  if  for  the  one  state  vj&  have 


(p  pidqi  =  I   piQidt  =  nji, 


0 

then  for  the  other  state  we  shall  have 

t'  -t  t 

(fpi'dqi'=    ( pi'qi'dr  =    (  i—pi)  (—qi)d(~i)=Cpiqidt  =  nill. 
•^0                                  0  0 

We  may  conclude  that  the  reverse  of  any  state  of  a  molecule  can  also 
exist,  and  indeed  in  a  system  which  has  come  to  equilibrium  we  shall 
expect  an  equal  number  of  molecules  in  each  of  any  pair  of  reverse  states. 

184.  On  the  Congruence  of  Reverse  States. — The  nature  and  be- 
havior of  reverse  states  is  going  to  be  very  important  for  our  later  work. 
In  particular  we  must  consider  whether  or  not  such  states  are  congruent, 
and  may  say  in  advance  that  reverse  states  are  apparently  sometimes 
congruent  and  sometimes  not  congruent  with  each  other,  depending  on 
the  complexity  of  the  molecule  involved.  In  addition  in  many  cases, 
although  perhaps  not  necessarily  in  all,  a  reverse  state  which  is  not 
congruent  with  the  original  state  may  become  so  at  a  later  time  through 
elastic  collisions  or  through  the  rotations  which  are  taking  place  in  the 
molecule.  i  S    i 

185.  As  a  first  example  let  us  take  a  simple  spherical  electron  moving 
with  a  definite  velocity  in  a  given  direction.  Here  evidently  the  reverse 
state,  obtained  by  reversing  the  direction  of  motion,  will  be  entirely 
congruent  with  the  original  state,  owing  to  the  symmetry  of  the  electron 
itself. 

186.  Proceeding  to  a  more  complicated  case  where  the  molecule  does 
not  have  spherical  symmetry,  let  us  next  consider  for  example  a  chloro- 
form molecule,  CH3CI,  with  the  chlorine  atom  and  the  three  hydrogen 
atoms  arranged  approximately  at  the  corners  of  a  tetrahedron,  and  the 
whole  molecule  moving  say  in  the  direction  of  the  chlorine  atom.  Here 
it  is  evident,  even  neglecting  the  effect  of  the  reversal  of  the  motion 
of  the  electrons  inside  the  atoms,  that  the  reverse  state  will  not  be  con- 
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gruent  with  the  original  state,  since  in  one  case  the  molecule  as  a  whole 
will  be  moving  in  the  direction  of  the  chlorine  atom  and  in  the  other 
case  in  the  opposite  direction.  Moreover,  this  reversal  might  have  im- 
mediate physical  consequences,  since  a  collision  with  the  chlorine  atom 
ahead  could  certainly  produce  different  results  from  one  with  the  hydro- 
gen atoms  in  the  lead.  Nevertheless,  it  must  be  noted  that  through  elastic 
collisions  with  other  molecules,  or  by  rotation  around  an  axis  perpen- 
dicular to  the  line  joining  the  chlorine  atom  to  the  center  of  gravity  of 
the  molecule,  the  new  molecule  can  assume  a  state  congruent  with  the 
original  one.  Since  numerous  collisions  always  do  take  place  in  the 
systems  of  interest,  and  since  in  addition  the  Occurrence  of  half  rota- 
tional quantum  numbers  makes  it  possible  that  molecules  are  necessarily 
rotating  even  at  the  lowest  temperatures,  we  must  hence  conclude  that 
the  lack  of  instantaneous  congruence  after  reversal  of  the  direction  of 
motion  will  have  no  effect  on  the  long  run  behavior  of  the  chloroform 
molecule.  In  general  we  conclude  that  any  lack  of  congruence  in  a  re- 
versed molecule,  due  to  the  change  in  the  direction  of  the  motion  as  a 
whole,  will  have  in  the  long  run  no  effect  on  the  physical  behavior  of 
the  molecule. 

187.    Turning  our  attention  next  to  the  effect  of  the  reversal  of  the 
internal  motions   of   the  electrons  inside  of   molecules,   let  us   first  take 
a  simple  Bohr  hydrogen  atom  with  the  electron  in  a  circular  orbit,  the 
atom  as  a  whole  having  a  velocity,   for  example,  in  the  direction   from 
the  nucleus  to  the  instantaneous  position  of  the  electron.     Consider  now 
the  reverse  state,  obtained  by  reversing  the  direction  of  rotation  of  the 
electron  and  reversing  the  direction  of  motion  of  the  atom  as  a  whole. 
By  rotating  this  reverse  atom  through  180°  around  an  axis  lying  in  the 
plane  of  the  orbit  perpendicular  to  the  direction  of  motion  of  the  atom, 
it  is  evident  that  we  give  this  new  atom  the  same  direction  of  motion  as 
a  whole  as  the  original  atom  and  also  give  its  electron  the  same  direc- 
tion of   rotation  in  the  orbit.     The  two  atoms,  however,  are  not  con- 
gruent,   since   the   original   atom   is    moving   in    the    direction   of    a    line 
from  nucleus  to  electron,  and  the  new  atom  in  the  direction  of  a  line  from 
electron  to  nucleus.    Here  too  at  first  sight  this  lack  of  congruence  might 
seem  to  be  of  physical  importance,  since  a  collision  with  the  nucleus  in 
advance  might  have  very  different  consequences  from  one  with  the  elec- 
tron in  advance.     It  must  be  noted,  however,  that  the  new  atom  will 
evidently  change  to  a  state  congruent  with  the  original  state  of  our  first 
atom,  provided  only  we  wait  for  a  half  period  of  rotation  to  complete 
itself.  •  Hence,  if  we  consider  the  long  run  behavior  of  the  two  atoms, 
the  lack  of  instantaneous  congruence  is  again  of  no  physical  significance. 
Indeed  it  might  be  more  in  accord  with  the  spirit  of  modern  quantum 
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theory  to  disregard  any  mention  of  instantaneous  positions  in  an  orbit, 
since  the  transfer  of  our  ordinary  concepts  of  space  and  time  to  describe 
the  internal  behavior  of  an  atom  may  not  be  justified. 

188.  Although  in  the  case  of  the  simple  Bohr  hydrogen  atom,  the 
reverse  of  any  state  is  congruent  with  the  original  state,  at  least  after 
a  proper  time  interval,  it  is  more  difficult  to  be  sure  as  to  what  would 
be  true  in  this  respect  in  the  case  of  more  complicated  atoms.  Consider, 
for  example,  an  atom  with  a  number  of  electrons  rotating  in  approxi- 
mately plane  elliptical  orbits,  and  for  simplicity  take  the  atom  as  a  whole 
at  rest.  It  is  quite  evident  that  the  immediate  result  of  reversing  the 
direction  of  motion  of  all  the  electrons  in  such  an  atom  will  not  be  to 
produce  a  congruent  state,  since  if  we  choose  any  pair  of  orbits  in  the 
reversed  atom  and  turn  the  atom  over  through  180°  around  the  line  of 
intersection  of  the  planes  of  these  two  orbits,  we  shall  indeed  rectify 
the  directions  of  rotation  in  these  two  orbits,  but  shall  in  general  put 
the  planes  of  the  other  orbits  in  positions  which  are  not  parallel  with 
their  old  positions  in  the  original  atom,  and  in  addition  shall  have  the 
further  difficulty  that  electrons  in  the  original  atom  which  are  moving 
towards  perihelion  will  in  the  reversed  atom  be  moving  away  from  peri- 
helion and  vice  versa.  For  an  atom  with  a  number  of  electron  orbits 
it  is  hence  evident  that  at  the  best  the  congruence  of  the  reverse  state 
will  only  be  achieved  after  a  suitable  time  interval. 

To  investigate  if  the  changes  in  state  occurring  in  an  atom  might 
lead  at  a  later  time  to  such  congruence,  we  may  recall  that  the  interpreta- 
tion of  spectral  levels  and  their  fine  structure  in  accordance  with  the 
Bohr  correspondence  principle,^  leads  to  the  belief  that  the  electrons  in  a 
general  atom  can  be  regarded  as  moving  in  approximately  plane  elliptical 
orbits,  with  a  precession  of  the  perihelion  in  the  plane  of  the  orbit  and 
in  addition  a  precession  of  the  plane  of  the  orbit  itself.  Hence,  if  this 
be  the  case,  considering  any  single  given  orbit  in  the  reversed  atom,  it  is 
evident  that  in  the  course  of  time  it  will  adjust  both  the  direction  of 
its  perihelion  and  direction  of  rotation  to  that  of  the  corresponding  orbit 
in  the  original  state.  This  means  of  course  that  in  the  absence  of  any 
relation  between  the  periods  of  the  precessions  for  different  orbits,  the 
reversed  atom  would  in  course  of  time  come  into  a  state  congruent  with 
the  original  state  of  the  first  atom.  It  seems  possible,  however,  that 
there  could  be  an  interlocking  between  different  orbits,  which  would  in- 
troduce relations  between  the  periods  of  precession,  which  would  prevent 
the  reversed  atom  from  ever  assuming  through  such  precessions  a  state 
congruent  with  the  original.  Such  interlocking  might  also  prevent  the 
occurrence  of  the  necessary  changes  through   ordinary   elastic   collision, 

*Bohr,  Ann.  dcr  Physik,  71,  248  (1923). 
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especially  since  the  inner  X-ray  levels  would  hardly  be  affected  by  ordi- 
nary collisions. 

189.  It  must  apparently  be  concluded  that,  in  the  present  state  of 
atomic  theory,^  we  cannot  assert  that  the  reversed  state  of  an  atom  will, 
through  ordinary  collision  or  its  own  internal  motions,  always  necessarily 
become  congruent  with  the  original  state.  We  have  of  course  in  this 
matter  the  slight  experimental  evidence  that  such  a  difference  between 
reversed  atoms  has  never  been  detected,  but  the  separation  and  detection 
of  the  two  forms  if  they  do  exist  would  be  an  almost  insuperable  problem. 
Under  the  circumstances,  even  though  it  will  complicate  our  later  con- 
siderations, we  shall  proceed  as  much  as  possible  on  the  basis  that  the 
reverse  state  of  a  molecule  is  not  necessarily  equivalent  to  the  original 
state. 

190.  Changes  of  Molecular  State  through  Collision. — ^In  general 
any  far-reaching  change  in  the  state  of  a  molecule  involves  a  change 
in  its  energy  content  which  can  of  course  only  result  through  collision 
with  other  molecules  or  interaction  with  radiation.  We  shall  treat  the 
case  of  collision  first. 

a.  Definition  of  Collision  and  Constellation. — Following  the  procedure 
of  Boltzmann,  it  will  be  convenient  to  consider  that  a  collision  occurs 
when  the  centers  of  gravity  of  the  two  molecules  involved  approach  within 
a  certain  set  distance  b,  which  is  chosen  large  enough  so  that  the  mutual 
interaction  of  the  molecules  when  they  are  farther  apart  than  this  may 
be  regarded  as  negligible.  At  any  instant  during  the  course  of  the  col- 
lision we  shall  designate  the  state  of  the  two  molecules  as  a  constellation. 

b.  Classi£cation  of  Constellations. — We  shall  use  the  terms  congruent, 
cnantiomorphous  and  rez^erse  to  classify  different  constellations  in  the 
same  sense  as  for  single  molecules.  In  addition  in  the  case  of  constella- 
tions we  shall  need  two  further  categories. 

We  shall  use  the  term  critical  constellation  to  designate  the  state  of 
the  system  of  two  molecules  at  the  beginning  or  at  the  end  of  the  col- 
lision when  the  distance  apart  of  the  two  centers  of  gravity  assumes 
exactly  the  value  b.  Further  two  constellations  will  be  designated  as  cor- 
responding constellations  when  they  differ  only  in  an  interchange  in  the 
positions  of  the  centers  of  gravity  of  the  two  molecules,  leaving  unchanged 
all  the  velocities  and  the  values  of  those  coordinates  which  determine  the 
internal  configurations  of  the  molecules. 

It  will  be  convenient  to  designate  different  constellations  by  symbols 
of  the  form  (a,  b)  where  the  letters  inside  the  bracket  refer  to  the  states 

'  The  later  development  of  atomic  theory  may  well  make  use  of  the  new 
rnechanics  of  Heisenherg,  Born  and  others  rather  than  the  quasi-classical  mechanics 
hitherto  used.     See  Chapter  7. 
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of  the  two  molecules  involved.  The  corresponding  constellation  to  (a,  b) 
will  be  designated  by  the  symbol  (b,  a)  and  the  reverse  constellation  by 
the  symbol  ( — a,  — b). 

If  (a,  b)  designates  a  critical  constellation  occurring  at  the  beginning  of 
a  collision,  it  is  evident  that  the  corresponding  constellation  {b,  a)  and  the 
reverse  constellation  ( — a,  — b)  are  both  of  them  critical  constellations 
with  which  a  collision  might  end.  Furthermore  it  should  be  noted  that 
we  can  obtain  the  totality  of  all  the  final  critical  constellations  with  which 
a  collision  can  end,  if  we  consider  the  totality  of  all  initial  constellations 
and  then  construct  either  all  the  corresponding  constellations  or  all  the 
reverse  constellations. 

c.  Classification  of  Collisions. — The  different  possible  kinds  of  col- 
lision which  can  occur  can  be  succinctly  described  by  specifying  the  critical 
constellations   with   which   the   collision   begins   and   ends,   and   we   shall 


denote  these  different  kinds  of  collision  by  symbols  of  the  form 


/a,  b\ 


where  (a,  b)  is  the  initial  and  (c,  d)  the  final  critical  constellation. 

In  classifying  diiTerent  kinds  of  collisions  we  shall  use  the  terms 
concp^uent,  cnantiopharous,  and  reverse  in  the  evident  sense.  We  shall 
also  introduce  the  further  terms  corresponding  collision  and  inverse 
collision. 

. .       ■ .    /«^  ^^   , 

Considering  any  given  origmal  collision    I         ,  I  ,  the  corresponding 

collision  will  be  defined  as  one  which  starts  with  the  initial  constellation 
(d,  c)  which  corresponds  to  the  final  constellation  (c,  d)  resulting  in 
the  first  collision,  and  can  be  derived   from    {c,  d)    by  interposing  the 

/d,  c\ 

positions  of  the  centers  of  gravity  of  the  two  molecules.     Thus    I  j 

.  .  /a,  b\ 

is  the  corresponding  collision  to  I         ,1. 

/a,  b\ 
Considering  again  some  given  original  collision  I  I ,  we  shall  often 

be  specifically  interested  in  methods  of  getting  the  two  molecules  back  into 

states  congruent  with  their  original  states  a  and  b.     Such  a  collision  will 

be  called  the  inverse  collision  to  the  original  collision  and  could  be  desig- 

/C,  d'\     .  . 

nated   by   the   symbol    I     /    ; ,  )     if   desired,   where  the   primes    indicate 

the  congruence  rather  than  identity  of  states  involved  in  the  two  collisions. 

fa,  b\ 

It  is  evident  that  for  any  given  collision    I         ,1    there  always  exist 

congruent  collisions   i    ,     r,  )  and  the  corresponding  collision    I     '      J . 
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The   existence   of   the   reverse  colHsion    I         '        ,1     and   the   inverse 

\~a,  —bj 


colHsion   I     ,    r,  )    must  be  given  more  special  consideration. 

191.  Changes  of  Molecular  State  through  Interaction  with 
Radiation. — Changes  in  the  state  of  a  molecule  may  occur  not  only  by 
collision  with  other  molecules  but  also  by  the  absorption  or  emission  of 
radiation.  A  complete  and  correct  description  of  the  mechanism  of  the 
absorptive  and  emissive  processes  cannot  now  be  given,  and  we  shall 
have  to  wait  for  such  an  account  until  the  science  of  physics  has  solved 
the  difficult  and  baffling  problem  of  the  interrelation  between  the  quantum 
theory  and  the  wave  theory  of  light.  For  our  present  purposes,  how- 
ever, we  can  schematically  represent  the  absorptive  process  by  which  an 
atom  or  molecule  A  is  raised  from  a  given  quantum  state  Ai  to  a  higher 
state  Aj,  by  the  expression 

hvi,+Ai  >  A  J  (324) 

where  v  is  the  frequency  of  the  radiation  which  leads  to  the  absorption, 
hv  is  the  energy  increase,  and  the  subscrift  ij  is  intended  to  indicate 
that  the  active  radiation  has  not  only  the  right  frequency,  but  all  other 
characteristics  such  as  proper  polarization,  phase,  spatial  distribution  and 
so  forth  which  may  be  necessary  for  the  phenomenon  of  absorption.* 

If  we  turn  now  to  the  emissive  process  by  which  the  molecule  can 
drop  from  quantum  state  ;  to  state  i,  it  is  evident  that  the  radiation 
accompanying  the  emission  will  have  the  same  frequency  as  that  previously 
active  in  the  absorption  process,  but  in  our  present  state  of  partial 
knowledge  it  cannot  definitely  be  stated  that  all  the  properties,  in  par- 
ticular the  spatial  distribution  of  the  two  radiations,  are  the  same.  For 
this  reason  it  will  be  safest  to  designate  the  emission  process  by  the 
formula 

Aj  >  Ai  +  hvji  (325) 

where  the  transposition  of  the  subscripts  from  the  order  ij  to  ji  indicates 
that  we  are  not  asserting  an  identity  in  the  radiation  active  in  emission 
and  absorption  except  as  to  frequency. 

*  The  more  general  case  of  interaction  between  radiation  and  molecules,  in 
which  the  radiation  is  not  completely  absorbed,  as,  for  example,  in  the  Compton 
effect,  could  be  designated  by  an  expression  of  the  form 

hvij  +  Ai^Aj  +  hvji  (324)' 

In  addition  cases  of  a  combined  collisional  and  radiational  nature  must  sometimes 
be  considered,  for  example  the  capture  of  an  electron  by  an  ionized  atom 

Na^  +  E-^Na  +  h^  (324)" 
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Just  as  in  the  case  of  collisions  we  shall  wish  to  speak  of  congruent, 
enantiomorphous,  reverse,  corresponding,  and  inverse  interactions  with 
radiation.  In  analogy  with  our  previous  usage,  it  is  evident  that  the 
emissive  process  given  by  expression  (325)  would  be  the  corresponding 
process  to  the  absorption  given  by  (324).  The  reverse  process  to  (324) 
could  be  designated  by  the  formula 

A.j  >  A.i  +  hv.i.j  (326) 

and  the  inverse  process,  if  it  exists,  by  the  formula 

Aj  >  Ai  +  hvij  (327) 

where  the  change  in  the  order  of  the  subscripts  from  that  in  expression 
(325)  indicates  the  complete  identity  of  the  emitted  radiation  with  that 
absorbed  in  process  (324). 

192.  On  the  Occurrence  o£  Congruent  Processes. — In  the  fore- 
going sections  we  have  described  both  in  the  case  of  collisional  and 
radiational  interactions  what  we  mean  by  congruent,  enantiomorphous, 
reverse,  corresponding,  and  inverse  processes,  and  must  now  consider  the 
possibility  and  frequency  of  occurrence  of  such  processes. 

It  is  at  once  evident  that  there  is  no  limitation  to  the  occurrence 
of  molecular  processes  which  are  congruent  with  one  another,  since  by 
a  mere  transformation  of  coordinates  the  equations  describing  two  con- 
gruent processes  assume  the  same  form.  Furthermore,  since  we  have 
adopted  the  convention  of  always  using  the  same  volume  of  phase  space 
(jM-space)  dqi  ....  dpn  or  li^  in  specifying  the  state  of  a  molecule,  it  is 
evident  that  in  a  large  system  which  has  come  to  equilibrium  the  prob- 
ability for  the  occurrence  of  any  molecular  process  is  the  same  as  for 
any  process  congruent  with  it.  For  this  reason  we  often  speak  of  con- 
gruent processes  as  being  the  same  process. 

193.  On  the  Occurrence  of  Enantiomorphous  Processes. — Except 
in  the  field  of  organic  chemistry  we  are  not  usually  specially  interested 
in  enantiomorphous  states  and  enantiomorphous  processes.  It  is  interest- 
ing to  note,  however,  that  in  a  system  which  has  come  to  complete 
"racemic"  equilibrium  we  may  also  expect  the  same  probability  of  occur- 
rence for  enantiomorphous  processes. 

194.  On  the  Occurrence  of  Reverse  Processes. — The  actual  pos- 
sibility for  the  occurrence  of  reverse  collisions  and  reverse  absorptions 
and  emissions  is  made  certain  by  the  principle  of  dynamical  reversibility 
which  we  have  already  discussed  in  an  earlier  section  of  this  chapter, 
since  in  the  absence  of  an  external  magnetic  field,  this  principle  guar- 
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antees  the  possibility  of  the  time  reversal  of  any  internal  molecular  process 
which  may  take  place. 

Furthermore,  in  a  system  containing  a  large  number  of  molecules 
of  each  of  the  kinds  involved,  we  must  evidently  expect,  when  equi- 
librium has  been  attained,  that  the  rate  at  which  any  molecular  process 
is  taking  place  is  equal  to  the  rate  at  which  the  reverse  process  is  taking 

place.     Thus  at  equilibrium  the  collision    I        j]    will  occur  with  the 


/-c,-d\ 
\—a,  —bj- 


same  frequency  as  the  reverse  collision    I  '  I .     And  the  absorp- 

tion JiVij  -j-  Ai  =z  Aj  will  occur  with  the  same  frequency  as  the  emission 
A.j  =  A.i  -f  hv^i^j. 

It  should  be  especially  noted,  however,  that  the  rczKrse  of  a  given 
process  does  not  leave  the  molecules  involved  in  the  state  which  they 
had  at  the  start  of  the  original  process,  but  rather  in  the  reverse  of 
that  state,  a  fact  of  considerable  significance  in  connection  with  our  later 
consideration  of  the  mechanisms  by  which  a  system  in  equilibrium  always 
maintains  a  constant  number  of  molecules  in  any  given  state. 

195.  On  the  Occurrence  of  Corresponding  Processes. — The  pos- 
sibility of  the  occurrence  of  corresponding  processes  is  also  evident.     If 

(a,  b\  ...  /d,  c\ 

,  j  ,  the  corresponding  collision,  say  I  l> 

must  certainly  be  possible,  since  it  will  occur  by  definition  when  mole- 
cules in  the  states  d  and  c  approach  each  other  at  the  proper  angle  within 
the  distance  h,  where  appreciable  interaction  can  begin.  Also  if  we  con- 
sider that  hvij  -\-  Ai-^  Aj  is  a  possible  absorption,  the  occurrence  of 
the  corresponding  emission' z:^; —> /^i -|- ^^^;i  is  guaranteed  by  the  corre- 
spondence principle,  since  on  the  basis  of  the  classical  theory  the  possibility 
of  absorption  and  emission  go  hand  in  hand. 

196.  In  the  case  of  corresponding  collisions,  a  special  theorem  of 
interest  and  importance  for  our  later  work  has  been  demonstrated  by 
Boltzmann,  which  we  may  call  the  theorem  of  the  cycle  of  corresponding 
collisions  and  shall  now  present. 

Consider  the  series  of  collisions 

/2,  1\  /4,3\  /6,5\  /k  —  3,k  —  4\/k  —  l,k  —  2\/k-{-l,k         \ 

\3,4)  \S,6)  \7,8) \k  —  2,k—l)\k,k  +  l         J\k  +  2,k  +  2)  ■-' 

where  the  integers  denote  states  of  the  molecules,  and  each  successive  col- 
lision corresponds  to  the  preceding  one,  that  is  starts  with  the  two  mole- 
cules in  the  states  in  which  they  were  left  by  the  preceding  collision, 
but  with  their  centers  of  gravity  transposed. 
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Let  us  now  define  the  different  states  of  our  molecules  1,  2,  3,  4,  5,  6, 
etc.,  by  assigning  to  each  state  a  specified  region  in  the  phase  space 
(//-space),  which  has  a  volume  small  enough  so  that  we  can  regard 
the  physical  behavior  of  the  molecule  as  determined  by  the  location  of  , 
the  specified  region,  but  a  volume  which  is  nevertheless  finite.  This  pro- 
cedure, which  introduces  no  theoretical  incorrectness,  has  the  advantage 
of  giving  us  only  a  finite  number  of  possible  states  to  consider. 

Returning  now  to  the  series  of  collisions  given  above,  it  is  evident, 
since  we  have  only  a  finite  number  of  possible  states,  that  by  extending 
the  series  to  the  right  we  must  at  last  come  to  a  collision  in  which  the 
final  states  of  the  two  molecules  are  the  same  as  the  initial  states  for 
some  earlier  collision,  that  is  to  a  collision  to  which  some  earlier  collision 
corresponds.  We  shall  now  prove  that  this  earlier  coUision  which  corre- 
sponds to  the  last  collision  of  the  series  must  as  a  matter  of  fact  be  the 

first  collision  of  the  series  \     '    .  ]. 

To  demonstrate  this,  let  us  assume  that   (  ,    ,    o'    ,    ,    ->  I     is  the  last 

\k-\-2,  k-\-3/   ^ 

collision  of  the  series  and  that  the  earlier  collision  corresponding  to  it  is 

/6,  5\ 

not  the  first  collision  of  the  series  but,  for  example,  a  later  one  I  -    „  I' 

/k-\-l,  k'      \ 
We  have  then  from  this  correspondence  that  the  collision  (7_i_9    fci-?/ 

_     '       J     or,  since  the  end  states  are  identical, 

.  .         /4,  3\ 
this  collision  assumed  as  the  last  must  actually  be  the  collision     I  r    zr  j 

,.        .  ,  .         /k—l,  k  —  2\ 

and  preceding  it   in  the  series  the  colhsiun     (  /,    i,  _i_  i  )     must  be 

/2,  1\  ,  .    \      /k  —  3,  k  —  4\ 

\  1    A  I  "^^^^  ^^^^  ^^^^^  '''^^^  member  of  the  series  be  I  ^    ^  j  as 

was  to  be  demonstrated. 

This  theorem,  that  the  series  of  corresponding  collisions  between  two 
molecules  form  a  closed  cycle,  will  be  of  use  to  us  when  we  later  come 
to  the  demonstration  of  Boltzmann's  famous  //-theorem. 

197.  It  would  be  of  interest  to  inquire  whether  a  similar  theorem 
holds  for  corresponding  processes  in  the  case  of  the  absorption  and  emis- 
sion of  radiation.  Here,  however,  we  shall  be  on  difficult  ground  until 
physics  has  solved  the  problem  of  the  nature  of  radiation.  Nevertheless, 
if  we  take  the  position  of  the  extreme  light  quantum  theory  and  assume 
that  the   nature  of   a  quantum   of   light  is  uniquely   determined   by   the 
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initial  and  final  states  of  the  atom  which  emits  or  absorbs  it,  we  can  con- 
clude that  the  quantum  hva  absorbed  by  the  process 

hvij  +  Ai  >  Aj 

is  identical  with  the  quantum  hvji  given  out  by  the  emission 

Aj  >  Ai  +  hvii 

and  hence  that  these  two  steps  alone  would  then  form  a  cycle  of  corre- 
sponding processes. 

198.  There  is  a  further  theorem  with  respect  to  corresponding  col- 
lisions which  we  must  now  deduce.  This  may  be  called  the  theorem  of 
equal  probahility  coefficients  for  corresponding  collisions.  The  proof 
may  be  obtained  as  follows. 

To  specify  completely  the  state  of  a  molecule  we  must  give  the  region 
in  the  phase  space  (^-space)  dq^dq-ydq-j,  dq^  ....  dqnSpi ....  dpn'm  which 
its  representative  point  falls.  The  first  three  coordinate  ranges  may  be 
taken  as  determining  the  location  of  the  center  of  gravity  of  the  mole- 
cule and  the  remaining  ranges  as  determining  the  values  of  its  internal 
coordinates  and  its  external  and  internal  momenta.  In  considering  dif- 
ferent kinds  of  collisions,  however,  we  are  usually  not  interested  in  the 
particular  location  in  space  where  the  collision  occurs,  but  desire  to  know 
merely  the  values  of  the  internal  coordinates  and  the  external  and  in- 
ternal momenta  for  a  molecule  that  is  entering  a  collision.  Hence  for 
such  purposes  we  shall  specify  the  state  of  a  molecule  by  giving  merely 
the  range 

dco  =  dqi  .  .  .  dqndpi  .  .  .  dpn  (328) 

in  which  the  internal  coordinates  and  the  external  and  internal  momenta 
lie.  Furthermore  we  shall  choose  equal  values  of  dco  for  specifying  differ- 
erent  possible  states  of  a  molecule. 

Consider  now  a  system  containing  Ni,  N2,  N3  and  A^4  molecules  in  the 
states  given  by  the  ranges  dcoi,  60)2,  Su>s  and  60)4,  and  let  the  molecules  in 
the  different  states  be  uniformly  distributed  throughout  the  total  volume  V 
of  the  system.     Furthermore  let  the  states  be  so  chosen  that  we  can  have 

a  collision  I  ^'   .  1    between  molecules  in  states  1  and  2  leading  to  mole- 


e:) 


cules  in  states  3  and  4.  Since  the  molecules  in  the  different  states  are 
uniformly  distributed  throughout  the  system,  the  chance  of  such  a  col- 
lision will  be  proportional  to  the  numbers  of  molecules  in  the  two  states, 
so  that  we  may  write  for  the  probable  number  of  such  collisions  occurring 
per  second  the  expression 

^l:  1=^1:1  ^1^'^  (329) 
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where  C"'  ^  is  a  probability  coefficient.     Similarly  for  the  number  of  corre- 

/4,  3\ 

sponding  collisions  of  the  type   I  -    f:  J   "^^  ^^^y  write 

^1:1=^11^^^^  (330) 

where  Ct'  ^  is  another  such  coefficient.  With  the  help  of  Liouville's 
theorem  we  shall  now  show,  however,  that  these  two  coefficients  are  equal. 

Let  us  follow  the  course  of  the  collision  I     '  ^^  )  by  the  motion  of  a 

representative  point  in  a  combined  phase-space  for  the  two  molecules, 
of  such  a  nature  that  the  position  of  the  representative  point  will  give 
us  the  location  of  the  center  of  gravity  of  the  first  molecule  relative  to 
that  of  the  second  and  give  us  the  ranges  dcoi  and  dcoz  for  the  internal 
coordinates  and  external  and  internal  momenta  of  the  two  molecules. 

At  the  start  of  the  collision  when  the  distance  apart  of  the  two  centers 
of  gravity  is  b,  let  the  center  of  gravity  of  the  first  molecule  He  in  a 
range  b^dXds,  where  ds  is  a  distance  laid  off  along  the  line  connecting 
the  centers  of  gravity  of  the  two  molecules  and  dX  is  a  solid  angle  drawn 
from  the  center  of  gravity  of  the  second  molecule,  the  quantity  b'dXds 
thus  being  a  small  volume.  Similarly  at  the  close  of  the  collision,  when 
the  distance  apart  of  the  two  centers  again  becomes  b,  let  the  center  of 
gravity  of  the  first  molecule  lie  in  a  similar  range  b'^dAdS.  Neglecting 
then  the  position  of  the  center  of  gravity  of  the  second  molecule,  which 
would  merely  tell  us  the  particular  location  in  space  where  the  collision 
takes  place,  our  representative  point  for  the  two  molecules  will  lie  in 
the  region  ^(J0l^co2b'^^^X^s  at  the  start  of  the  collision  and  in  the  region 
dco^dcOib^dAdS  at  the  end  of  the  collision.  In  accordance  with  Liouville's 
theorem,  however,  in  the  form  giving  the  conservation  of  extension  in 
phase  as  expressed  by  equation  (31)  in  Chapter  3,  we  must  have  an 
equality  between  these  two  phase  ranges 

dcOidco2b^dX  ds  =  dco3dcOib^dA  dS  (331) 

We  have,  however,  agreed  to  use  equal  regions  Sco  in  specifying  states 
of  the  molecules  so  that  we  obtain  by  cancellation 

b^-dX  ds  =  b'^SA  dS  (332) 

Finally  since  the  choice  of  ds  and  dS  is  still  open  to  our  disposal,  provided 
we  do  not  impose  values  on  dX  and  dA,  let  us  take  them  as  equal  to 
the  changes  in  the  distance  between  centers  that  will  occur  in  a  small  time 
interval  dt,  respectively  at  the  start  and  close  of  the  collision,  in  accordance 
with  the  equations 

ds=kdt       and       dS  =  Kdt  (333) 
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where  k  and  K  are  the  components  of  the  relative  velocity  of  the  two 
molecules,  parallel  to  the  line  of  centers,  at  the  start  and  close  of  the 
collision.     Substituting  in  (332)  we  obtain 

V-dX  kdt  —  b~bA  Kdt  (334) 

We  are  now  ready  to  consider  the  probable  frequency  of  the  collisions 
in  which  we  are  interested.     For  the  probable  number  of  collisions  of  type 


Gi) 


occurring  per  second  we  shall  evidently  obtain 


Zl:\=^-^N,^^  (335) 


since  the  chance  that  a  collision  will  occur  during  any  second  is  propor- 
tional to  the  chance  that  a  molecule  in  the  first  state  has  its  center  gravity 
at  the  beginning  of  that  time  in  the  range  h'-dX  k,  and  this  in  turn  will  be 
proportional  to  the  ratio  between  this  volume  and  the  total  volume  V. 

To  determine  the  probable  number  of  corresponding  collisions    (  -'       ) 

\5,  6 J 

we  must  merely  note  that  the  component  of  relative  velocity  K  with  which 

/2,  1\    . 

the  molecules  part  after  the  collision    I  ^     .1    is,   by  the  definition  of 

corresponding  collisions,  the  same  as  that  with  which  they  approach  in 

(4,  3\ 
-'      j    and  hence  may  write  for  the  prob- 
able number  occurring  per  second  the  expression 

Zll  =  '^N,N,  (336) 

Comparing  with  our  original  equations  (329)  and  (330),  however, 
we  see  that  we  have  thus  obtained  values  for  the  probability  coefficients 
of  collision,  giving  us  the  identifications 

2,  1        b~dX  k                      4,  3        b^'SA  K 
^  i,4=  —^       and       C^^= ^^—  (337) 

and  from  equation  (334)  we  obtain  the  equality  of  these  coefficients 

Cl\=Cll  (338) 

We  have  thus  demonstrated  the  equality  of  the  coefficients  which  de- 
termine the  probable  rates  at  which  corresponding  collisions  are  taking 
place.  This  theorem  will  also  be  of  the  greatest  value  in  our  derivation 
of  Boltzmann's  H-theorem. 
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199.  On  the  Occurrence  of  Inverse  Processes. — If  we  consider  any 
process  in  which  molecules  (including  quanta  if  necessary)  change  from 
one  set  of  states  to  another,  we  have  defined  the  inverse  of  this  process 
as  one  which  starts  with  all  the  molecules  involved  in  states  congruent 
with  those  in  which  they  have  been  left  by  the  first  process,  and  returns 
them  to  their  original  states. 

The  question  of  the  possibility  and  frequency  of  occurrence  of  in- 
verse processes  is  one  of  considerable  interest,  since  it  has  sometimes 
been  assumed  that  corresponding  to  any  given  molecular  process  there 
always  exists  an  inverse  process,  and  that,  in  a  system  which  has  come 
to  equilibrium,  the  constant  quotas  of  the  molecules  in  the  different  states 
is  assured  by  an  equality  between  the  rates  with  which  every  process 
and  its  inverse  takes  place.  Of  late  years  this  postulate  has  often  been 
used  as  a  starting  point  for  theoretical  investigations.  We  shall  find, 
however,  that  in  the  simple  and  general  form  in  which  the  above  prin- 
ciple is  usually  stated  it  certainly  cannot  be  universally  true,  and  we  shall 
need  in  general  to  proceed  carefully  in  deducing  the  consequences  which 
have  usually  been  based  on  this  postulate. 

200.  The  true  state  of  affairs  can  most  easily  be  understood  by  con- 
sidering the  relation  between  inverse  processes  and  reverse  processes. 
In  the  inverse  of  a  given  process  the  molecules  must  start  in  states  con- 
gruent with  those  in  which  they  have  been  left  by  the  first  process  and 
return  to  their  original  states.  In  the  reverse  of  a  given  process,  how- 
ever, the  molecules  start  in  states  which  are  the  reverse  of  those  occurring 
at  the  end  of  the  first  process  and  change  to  states  which  are  the  reverse 
of  the  original  ones.  Hence,  whenever  the  molecules  involved  are  such 
that  any  state  and  its  reverse  are  congruent,  the  reverse  process  will 
itself  be  an  inverse  process,  and  since  the  reverse  of  any  process  is  always 
possible,  we  actually  shall  have  in  such  cases  for  every  molecular  process 
an  inverse  process. 

In  general,  however,  we  have  found  that  the  reverse  of  a  given  state 
is  not  necessarily  congruent  with  the  state  itself,  and  hence  cannot  con- 
clude in  this  way  that  an  inverse  does  exist  for  every  process. 

201.  As  an  example  of  a  process  in  which  every  molecular  state  is 
congruent  with  its  reverse,  we  may  consider  the  collision  of  sohd  elastic 
spheres.  On  account  of  their  symmetry  a  reversal  in  the  direction  of 
the  motion  of  such  spheres  will  obviously  lead  to  a  congruent  state.  Hence 
the  reverse  of  any  given  collision  will  also  be  an  invei'se  collision  and 
will  return  the  spheres  to  states  congruent  with  their  original  ones,  a  re- 
sult which  can  easily  be  verified  in  detail,  by  applying  to  such  collisions 
the  laws  of  the  conservation  of  energy  and  momentum. 


199-204]       MOLECULAR  STATES  AND  PROCESSES 


165 


202.  When  we  consider,  however,  the  colHsions  of  less  symmetrical 
bodies  we  no  longer  find  that  reverse  and  inverse  processes  are  the  same. 
To  make  this  evident  by  a  simple  example,  let  us  consider  the  somewhat 
artificial  case  of  a  collision  between  a  sphere  and  a  wedge-shaped  solid 
such  as  shown  in  Figure  5,  where  the  sphere  and  wedge  are  taken  as 
having  the  same  mass,  and  the  point  of  impact  of  the  sphere  on  the 


BtFORE    COLtl5ION 


AFTER    COLLISION 


Figure  5. 


wedge  is  taken  so  that  the  collision  will  result  in  a  complete  transfer 
of  energy  from  the  sphere  to  the  wedge.  It  is  immediately  evident  that 
no  single  collision  is  possible  which  will  return  the  sphere  and  the  wedge 
to  states  congruent  with  their  original  motions.  Such  a  return  to  the 
original  states  can  only  be  secured  by  a  series  of  collisions,  such  as  the 
cycle  of  corresponding  collisions  first  considered  by  Boltzmann  and  dis- 
cussed above  in  Section  196. 

203.  Although  the  particular  illustration  given  above  was  a  trivial  one. 
chosen  for  its  simplicity,  we  find  it  necessary  to  conclude  in  general 
that  there  may  be  molecular  processes  for  which  no  inverse  exists  at  all, 
and  hence  in  our  further  development  shall  avoid  making  use  at  all  of 
the  concept  of  inverse  processes.^ 

204.  The  Principle  of  Microscopic  Reversibility. — In  our  section 
on  the  occurrence  of  reverse  processes,  we  came  to  the  conclusion  that 
in  a  system  not  subjected  to  an  external  field  of  the  magnetic  type,  which 
would  introduce  odd  powers  of  the  velocities  into  the  Lagrangian  equa- 
tions, the  reverse  of  any  molecular  process  is  also  a  possible  process. 
Furthermore,  when  such  a  system  comes  to  a  state  of  equilibrium  it  is 
evident  that  the  rate  at  which  any  molecular  process  is  taking  place  is 
equal  to  the  rate  at  which  the  reverse  process  is  taking  place.  We  shall 
call  this  conclusion  the  principle  of  microscopic  reversibility. 

In  more  specific  language,  if  i,  j,  k,  I,  etc.,  are  states  of  molecules  in 
the  system  and  — i,  — /,  — k,  — I,  etc.,  the  reverse  of  these  states,  this 
principle  requires,  when  equilibrium  has  been  reached,  that  the  num- 
ber of  processes  per  second  in  which  molecules  change  from  i  to  ;  and 
k  to  /,  etc.,  must  be  equal  to  the  number  of  reverse  processes  in  which 


Compare  Lorentz,  Situngsher.  d.  Akad.  d.  VViss.  cu  Wien,  2  abt.,  95,  115  (1887). 
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molecules  change  from  —  /  to  —  i  and  —  /  to  —  k,  etc.  As  already 
noted,  this  principle  as  just  stated  in  its  correct  form  must  be  carefully 
distinguished  from  the  incorrect  statement  that  the  number  of  processes 
characterized  by  i  to  /,  k  to  /,  etc.,  is  necessarily  equal  to  the  number 
j  to  i,  I  to  k,  etc.,  since  the  inverse  processes  may  not  even  exist. 

There  are  a  number  of  corollaries  to  this  principle  which  it  will  be 
well  for  us  to  consider. 

(a)  If  we  denote  the  rate  at  equilibrium  of  any  particular  process 
and  its  reverse,  in  which  a  molecule  changes  from  one  set  of  congruent 
states  i  to  another  set  /,  by  symbols  of  the  form  Zi_^j  and  Z_j_^_^  we 
may  evidently  write  the  equality 

Z^  i  -^  j  ^^  ^  —  }  ->  —  i  \  OO^  ) 

or  summing  up  for  all  processes  in  which  a  molecule  changes  from  set  / 
to  i,  may  write 

Ni^j  =^Zi^j  =^Z^j^-i=N-j-^-i  (340) 

Similarly  we  shall  have 

NU^-j     =Ni^-i  (341) 

and  if  we  desire  to  group  together  each  state  with  its  reverse,  we  may 
combine  equation  (340)  with  equations  (341),  and  obtain 

N±i^:^j  =:N±j^±i  (342) 

where  the  symbol  N^  i-^±j  denotes  the  total  rate  at  equilibrium  at  which 
molecules  in  the  set  of  congruent  states  i  or  its  reverse  —  i  are  changing 
to  molecules  in  the  set  of  states  /  or  its  reverse  — ;'. 

(b)  For  those  particular  cases  where  each  of  the  states  of  the  mole- 
cule and  its  reverse  are  congruent,  equations  (339)  and  (340)  reduce 
to  the  special  form 

^  t  — » y  —  ^  /  ->  i  (^  «3'to  ) 

and 

Ni^,=Nj^i  (344) 

(c)  li  Zi_^j,  the  rate  at  equilibrium  for  a  particular  process  by  which 
molecules  change  from  the  set  of  congment  states  i  to  the  set  ;',  happens  to 
to  be  equal  to  the  rate  Z_i^_-  for  an  analogous  process  for  the  reverse 
states,  we  shall  then  have  by  the  principle  of  microscopic  reversibility 

•^  i  -» y        ——  ^  —  y  -» —  1 

7  -7  (^45) 

^  —  t  -*  —  y  —  ^  y  -» t 
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which  combined  with  our  special  assumption  of  the  equahty 

Z,-.y=:Z_i^_y  (346) 

will  also  give  us  for  this  special  case  the  result 

Zi^j=Zj^i  (347) 

Such  a  condition  of  affairs  in  which  this  special  result  might  be  expected 
to  hold  would  be  if  the  processes  i  to  /  and  — i  to  —  /  consisted  in 
the  emission  of  radiation,  since  in  accordance  with  the  correspondence 
principle  we  should  then  expect  these  processes  i  to  /  and  —  z  to  - —  /  to 
occur  with  the  same  frequency. 

In    following   chapters    we    shall    find    many   occasions    to    apply    the 
principle  of  microscopic  reversibility  and  its  corollaries. 


Chapter  i6 
The  Absorption  and  Emission  of  Radiation 

205.  Introduction. — In  an  earlier  chapter  we  have  deduced  the 
statistical  equilibrium  between  radiation  and  matter,  by  treating  the  hohl- 
raum  as  containing  modes  of  electromagnetic  vibration  of  different  fre- 
quencies, in  analogy  to  the  modes  of  vibration  of  an  elastic  solid,  and 
then  assigning  to  each  mode  of  vibration  the  energy  that  it  should  have 
on  the  basis  of  the  Maxwell-Boltzmann  distribution  law  as  modified  to 
take  account  of  quantum  phenomena.  In  this  way  we  have  obtained 
a  satisfactory  derivation  of  the  Planck  radiation  law  for  the  distribu- 
tion of  energy  between  the  different  frequencies. 

The  above  method  of  derivation  gave  us,  however,  no  insight  into 
the  mechanism  by  which  the  continual  absorption  and  emission  of  radia- 
tion by  the  walls  of  the  container  actually  does  maintain  the  equilibrium 
distribution  of  energy  among  the  different  frequencies.  In  addition  the 
increasingly  apparent  possibility  that  radiational  energy  is  actually  cor- 
puscular in  nature  makes  the  treatment  of  the  hohlraum  as  a  system  of 
modes  of  electromagnetic  vibration  seem  somewhat  artificial.  For  these 
reasons  a  derivation  of  the  Planck  radiation  law,  by  a  method  which  is 
based  on  considerations  of  the  probability  of  the  absorption  and  emission 
by  atoms  of  radiation  of  different  frequencies,  would  contain  elements 
of  superiority.  The  considerations  of  the  last  chapter  now  make  it  pos- 
sible to  present  such  a  derivation  of  the  Planck  radiation  law,  a  derivation 
first  due  to  Einstein.  Furthermore  the  considerations  now  to  be  pre- 
sented have  the  additional  advantage  of  permitting  a  calculation  of  the 
mean  life  of  molecules  in  different  quantum  states  from  a  knowledge 
of  the  intensities  of  spectral  lines. 

206.  Einstein's  Coefficients  of  Absorption,  Emission  and  In- 
duced Emission, — Consider  an  atom  or  molecule  capable  of  existing  in 
different  quantum  states,  and  consider  two  of  these  states  i  and  j,  of  such 
a  nature  that  the  atom  can  pass  from  i  to  ;  by  the  absorption  of  a  quantum 
of  radiation  of  magnitude  hv,  and  pass  in  the  reverse  direction  by  the 
emission  of  a  quantum  of  the  same  magnitude. 

207.  If  we  should  have  an  atom  of  this  kind  present  in  the  state  / 
in  free  space,  there  would  evidently  be  a  certain  probability  in  unit  time 

1 68 
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of  a  spontaneous  passage  to  the  lower  state  i  by  the  emission  of  a 
quantum  of  radiation.  We  shall  call  this  quantity  the  probability  of 
spontaneous  emission  and  designate  it  by  the  symbol  Aj^i.  It  is  the 
first  of  Einstein's  three  coefficients. 

208.  If,  instead  of  being  in  free  space  our  atom  should  be  surrounded 
by  radiation,  it  is  evident  from  the  correspondence  principle  that  the 
chance  of  emitting  radiation  would  be  different,  since  on  the  basis  of  the 
classical  theory  the  rate  of  energy  emission  of  an  isolated  atom  is  differ- 
ent from  the  rate  of  emission  of  an  atom  whose  moving  electrons  are 
interacting  with  the  varying  electromagnetic  field  present  in  a  radiation 
field.  Hence,  to  obtain  the  total  probability  of  emission  we  must  in- 
clude a  term  depending  on  the  density  of  radiation,  and  shall  take  the 
total  probability  of  emission  in  unit  time  as  given  by  the  expression 
Aj^i  -\-u^B  j^i  where  u^  =  (du/dv)  ^  _  ^^^  is  the  density  of  radiational 
energy  per  unit  frequency  interval  ^  at  the  frequency  vu  involved  in  the 
transition,  and  Bj^i  is  a  second  constant  coefficient  dependent  on  the 
nature  of  the  states  i  and  /.  We  shall  call  this  quantity  5y-»i  the  coeffi- 
cient of  induced  emission. 

209.  Einstein's  procedure  in  placing  the  total  probability  of  emission 
equal  to  the  sum  of  two  terms  of  the  form  given  above  seems  a  natural 
one,  but  of  course  contains  a  certain  element  of  seemingly  arbitrary 
assumption  which  will  only  be  justified  by  the  later  results.  In  think- 
ing about  the  process  of  emission,  it  is  perhaps  unwise  to  consider 
spontaneous  emission  and  induced  emission  as  two  different  kinds  of 
process.  The  quantities  A,  ^  i  and  u^Bj^  i  have  the  same  physical  dimen- 
sions, and  if  we  are  willing  for  the  moment  to  use  the  language  of 
virtual  radiation  fields,  we  may  regard  both  kinds  of  emission  as  induced, — 
Aj^i  being  the  probability  of  induced  emission  due  to  the  virtual  field 
of  the  atom  itself,  and  u^Bj-^i  that  due  to  the  virtual  field  of  the 
surroundings.^ 

It  must  also  be  noted  in  connection  with  the  nature  of  emitted  light 
that  on  the  basis  of  the  correspondence  principle  we  should  expect  radia- 
tion induced  by  an  external  field  to  be  coherent  with  the  radiation  associated 
with  that  field. 

210.  Turning  now  to  the  probability  of  absorption,  it  is  evident  that 
this  will  also  be  dependent  on  the  density  of  radiation  of  the  frequency 
involved,  and  we  shall  take  this  probability  as  equal  to  u^Bi^j  where  the 

*  In  what  follows  we  shall  use  the  term  radiation  density  to  signify  density  of 
energy  per  unit  volume  per  unit  frequency  interval. 

"Note  the  conflicting  view  presented  by  G.  N.  Lewis,  Proc.  Nat.  Acad.  Sci.,  11, 
179  (1925).     Compare  footnote  to  Section  213. 
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quantity  Bi  ->;•  is  a  third  constant  dependent  on  the  nature  of  the  states  i 
and  y  which  we  shall  call  the  coefficient  of  absorption. 

211.  The  Maintenance  of  Equilibrium  by  Absorption  and  Emis- 
sion.— Consider  now  a  system  of  atoms  and  radiation  in  equilibrium  and 
let  Ni  and  Nj,  N.i  and  N.j  be  the  numbers  of  atoms  in  the  system  in 
the  states  i  and  ;  and  their  reverse  states  —  i  and  —  /.  Then  in  accordance 
with  the  principle  of  microscopic  reversibility,  we  may  equate  the  num- 
ber of  changes  per  second  from  state  i  to  /  occurring  through  absorp- 
tion with  the  number  of  changes  occurring  through  emission  between  the 
reverse  states  —  /  and  —  i,  giving  us  the  equality 

NiU,Bi^j=N-j{A-j^-.i+u,B-i-,-i)  (348) 

In  accordance  with  the  principle  of  correspondence,  however,  we  may 
expect  the  emission  of  radiation  by  atoms  in  states  /  and  — /  to  have 
the  same  probability  of  occurrence,^  and  hence  in  agreement  with  Corol- 
lary c  of  the  principle  of  microscopic  reversibility  as  given  in  the  last 
chapter  we  may  write 

NiU^Bi^j  =Nj(Aj^i  -f  w,B/_>0  (349) 

as  one  of  the  conditions  which  must  hold  at  equilibrium. 

If  we  have  a  state  of  equilibrium,  however,  we  must  have  the  addi- 
tional condition  that  the  numbers  of  molecules  in  the  two  states  i  and  ; 
shall  stand  in  the  ratio  prescribed  by  the  Maxwell-Boltzmann  distribu- 
tion law,  giving  us 

^  =  |i.lr  (350) 

N)        Pi 

where  /><  and  />/,  the  a-priori  probabilities  of  the  two  states,  are  unity  in 
case  the  systems  are  non-degenerate,  and  hv  is  evidently  the  proper  value 
to  use  for  the  energy  difference  of  the  two  states. 

Hence,  if  equilibrium  is  to  be  maintained,  the  interaction  of  radia- 
tion and  matter  must  be  such  that  equations  (349)  and  (350)  both  hold. 

212.  Derivation  of  the  Planck  Distribution  Law. — Substituting 
equation  (350)  in  (349)  and  solving  for  the  radiation  density  u^,  we 
easily  obtain 

A  j ^i /B  j ^i    A  j ^ i/B  j ^ i  t"^z^\ 

NjBj^i  pjBj^i 

The  apparently  incongruous  factor  piBi-^  j/pjBj  ^i  can  easily  be  elimi- 

'Note  that  at  equilibrium  A^^^  =  A/".y,  and  with  the  nomenclature  used  we  may 
regard  Uv  as  equal  to  m^^    +  m_,,  where  u^^  =  M_y. 


(353) 
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nated  from  this  equation,  since  for  an  infinite  value  of  the  temperature  T, 
the  value  of  the  radiation  density  u^  must  itself  go  to  infinity,  and  this 
will  only  be  true  if  the  factor  in  question  is  equal  to  unity,  giving  us 
the  equation 

Tir^  =  '  (352) 

so  that  we  may  now  rewrite  (351)  in  the  form 

^^-'  ^Ir  — 1 

Finally  the  equation  can  be  changed  into  the  usual  expression  for 
the  Planck  distribution  law,  if  we  recall  that  in  accordance  with  the 
Wien  displacement  law  u^  must  in  any  case  have  the  form  «^  =  v^<l>{T/v), 
permitting  us  to  write 

^nhv'^         1  r-iCA\ 

«.=-^r-T7 (354) 

ekT  —  1 

where  Snh/c^  may  be  regarded  as  an  experimental  value  of  the  constant 
to  be  introduced,  or  may  be  deduced  from  the  hypothesis  that  the  classical 
Rayleigh-Jeans  formula,  based  on  the  equipartition  of  energy 

u,  =  ^kT  (355) 

must  hold  when  the  temperature  T  is  high  and  the  frequency  v  small. 

213.  Relation  between  Einstein's  Coefficients. — The  foregoing 
treatment  provides  us  not  only  with  a  derivation  of  the  Planck  distribu- 
tion law,  but  also  with  a  set  of  relations  between  the  important  quantities 
which  we  have  called  Einstein's  coefficients  of  absorption  and  emission. 

In  accordance  with  equation  (352),  we  may  write 

Bi-.i=^B,^j  (356) 

Pi 

and  combining  with  equations    (353)    and    (354)    obtain 

*  Substituting  equation    (357)    in  our  previous  expression   given  in   Section  208 
for  the  total  probability  of   emission,   we  obtain   for  this   quantity  the  expression 


( — 5 —  -{- Uvj  Bj-^i.    Noting  that     — j —     has  the  same  physical  dimensions  as  u 

we  see  that  by  regarding    — -^ —    as  a  virtual  radiation  density 
we  can  consider  all  emission  as  induced,  as  already  remarked. 
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We  thus  see  that  the  three  coefficients  are  so  related  that  the  value 
of  any  two  can  be  calculated  from  the  third.  This  is  of  importance  in 
considering  the  intensities  of  absorption  and  emission  lines. 

214.  Interpretation  of  the  Width  of  Spectral  Lines. — When  we 
come  to  the  application  of  the  foregoing  considerations  to  the  interpreta- 
tion of  the  intensities  of  absorption  or  emission  of  spectral  lines,  we  are 
immediately  met  by  the  experimental  fact  that  spectral  lines  actually 
have  in  general  a  small  but  definite  width,  the  intensity  usually  falling 
off  rapidly  on  both  sides  of  an  approximately  central  maximum. 

The  quantum  theory  explanation  for  this  fact  lies  in  the  hypothesis 
that,  in  an  actual  gas,  the  molecules  in  a  given  quantum  state  (i.e.  with 
given  values  of  the  quantum  numbers)  do  not  all  have  exactly  the  same 
energy  content.  This  arises  from  the  fact  that  molecules  in  a  given 
quantum  state  have  different  velocities,  and  are  differently  affected  by 
the  stray  fields  from  neighboring  molecules.  Hence  when  a  transition 
takes  place  from  one  given  set  of  quantum  numbers  to  another,  the  energy 
emitted  or  absorbed  may  not  be  exactly  the  same  as  it  would  be  if  the 
molecules  were  stationary  and  in  free  space,  and  hence  the  magnitude 
of  the  quantum  hv  will  also  vary  slightly  for  a  given  change  in  quantum 
numbers.  We  have  indeed  experimental  evidence  that  such  causes  would 
affect  the  frequency  of  the  radiation,  since  the  quantum  theory  inter- 
pretation of  the  Doppler  effect  shows  the  influence  of  kinetic  energy  on 
the  frequency  of  spectral  lines  and  the  experimental  results  and  theo- 
retical considerations  of  the  Stark  effect  illustrate  the  influence  that  is 
to  be  expected  from  stray  electric  fields. 

In  addition  to  these  known  causes  for  the  broadening  of  spectral  lines, 
it  is  also  possible  that  there  is  an  inherent  weakness  in  the  quantization 
of  molecular  states,  and  consequent  lack  of  sharpness  in  the  energy  levels, 
which  would  give  some  width  to  the  spectral  lines  even  from  a  set  of 
stationary  molecules  in  free  space.  At  the  present  time,  however,  we  have 
no  direct  experimental  evidence  as  to  what  this  "natural"  width  of  spec- 
tral lines  would  be,  if  it  really  exists.  In  the  cases  of  usual  interest, 
however,  it  would  probably  be  small  compared  with  the  moderate  widen- 
ing produced  by  the  Doppler  effect  and  the  usually  much  larger  widening 
produced  by  the  fields  from  neighboring  molecules. 

It  is  evident  that  these  factors  operative  in  disturbing  the  energy  levels 
of  molecules  and  thus  affecting  the  width  of  spectral  lines  might  also 
have  an  effect  on  the  probability  of  transition  from  one  level  to  another. 
Thus  the  probability  that  an  isolated  molecule  surrounded  by  radiation 
of  unit  density  at  the  frequency  necessary  for  interaction  would  absorb 
a  quantum  of  radiation  might  be  different  from  the  probability  of  absorp- 
tion for  a  molecule  also  under  the  influence  of  radiation  of  the  right  fre- 
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quency  and  unit  density  but  affected  by  the  fields  from  neighboring 
molecules.  We  have  indeed  some  experimental  evidence  that  this 
might  be  a  possibility,  since  the  experiments  of  Fiichtbauer,  Joos  and 
Dinkelacker  ^  show  that  in  the  case  of  a  given  amount  of  mercury  vapor 
in  the  normal  1^  state,  the  total  number  of  quanta  absorbed  by  transi- 
tion to  the  2p2  state  (A  2537),  drops  by  about  50  per  cent  when  nitrogen 
at  a  pressure  of  50  atmospheres  is  added  to  the  previously  dilute  mercury 
vapor,  the  intensity  of  light  being  kept  the  same.''  Hence,  since  the  total 
intensity  of  a  line  is  affected  by  the  presence  of  neighboring  molecules, 
it  seems  possible  that  the  relative  intensities  within  the  width  of  an 
absorption  line  might  also  be  due,  not  only  to  the  relative  numbers  of 
molecules  having  the  requisite  energy  levels  for  the  frequency  in  ques- 
tion, but  to  a  variation  in  the  probability  of  absorption  as  we  pass  across 
the  line.  We  have,  however,  no  direct  experimental  evidence  on  this 
point,  and  consideration  of  the  question  makes  it  seem  probable  that 
such  a  possible  variation  in  probability  of  absorption  as  we  pass  across  a 
line  is  of  secondary  importance  in  accounting  for  the  intensity  distribu- 
tion within  the  line  as  compared  with  the  variation  in  numbers  of  mole- 
cules having  the  requisite  energy  levels.  Hence,  in  the  interests  of 
simplicity,  at  the  present  stage  of  experimental  and  theoretical  develop- 
ment, we  shall  feel  justified  in  treating  Einstein's  coefficients  Aj-^t, 
Bj  -*  i,  and  Bi^  j  as  constant  quantities  applying  at  any  point  within  a 
spectral  line.^ 

215.  Relation  between  the  Probability  of  Absorption  Bi^j  and 
the  Ordinary  Coefficient  of  Absorption.^ — We  are  now  ready  to  in- 
terpret the  intensity  of  absorption  lines  with  the  help  of  the  foregoing 
considerations,  and  indeed  to  use  the  experimental  measurements  of  in- 
tensities for  calculating  the  values  of  Einstein's  coefficients. 

Consider  an  ordinary  experiment  on  the  intensity  of  absorption  lines, 
in  which  a  beam  of  light  of  total  intensity  /  as  measured  in  ergs  per 
square  centimeter  per  second  falls  on  a  layer  of  the  absorbing  medium 
of  thickness  dx  and  let  us  define  the  intensity  of  light  /^  for  a  given 
frequency  by  the  expression 

1,=^  (358) 

Since  the  energy  is  travelling  with  the  velocity  c  it  is  evident  that  the 
radiation  density  ii„  will  be  related  to  the  intensity  /^  by  the  equation 

°  Fuchtbauer,  Joos  and  Dinkelacker,  Ann.  der  Physik,  71,  204  (1923). 

"More  familiar  deviations  from  Beer's  law  of  absorption  should  also  be  noted. 

'  For  treatments  where  this  constancy  is  not  assumed,  see  Pauli,  Zeitschr.  filr 
Phys  18,  272  (1923)  ;  Einstein  and  Ehrenfest,  Zeitschr.  fur  Phys.,  19,  301  (1923)  ; 
Tolman  and  Badger,  Phys.  Rev.,  27,  383  (1926). 
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u,  =  -  (359) 

Let  us  now  consider  the  diminution  in  the  intensity  of  this  Hght,  on 
passage  through  the  layer  of  thickness  dx,  due  to  the  absorption  of  hght 
by  molecules  passing  from  a  lower  state  i  to  a  higher  state  /, 

If  Ni  is  the  total  number  of  molecules  per  cubic  centimeter  in  the 
lower  quantum  state,  then  the  number  in  the  layer  of  unit  cross  section 
and  thickness  dx  will  be  Ni  dx.  In  accordance  with  our  discussion  of  the 
width  of  spectral  lines,  however,  we  shall  only  be  interested  for  the 
moment  in  those  molecules  which  are  not  only  in  the  lower  quantum  state, 
but  are  so  affected  by  their  velocity  and  by  the  fields  from  neighboring 
molecules  as  to  be  capable  of  picking  up  energy  in  the  range  y  to  j'  -]-  dv, 

and   can  take   for   this   number   of   molecules   the   expression  -^ —  dvdx. 

Hence  the  rate  at   which   energy   in  the   frequency   range  v  to  v  -{-  dv 
is  being  removed  from  the  beam  in  the  layer  dx  will  evidently  be 


hv 


(?^  dvdx\  u^Bi^j  (360) 

On  the  other  hand,  by  induced  emission,  radiation  coherent  with  the 
oncoming  beam  will  evidently  be  returned  at  the  rate 

hv  (-^  dvdxj  u^B  j-^i  (361) 

The  difference  of  these  two  terms  for  energy  removed  and  returned 
per  second  will  evidently  give  the  decrease  in  the  energy  passing  in  one 
second  through  the  layer,  so  that  combining  (360)  and  (361)  and  sub- 
stituting (359),  we  can  write 

_a(IM)  =^  l(^^  B  i^,  -^  B  j^i^jSvdx         (362) 

or  transposing  and  eliminating, 

This  logarithmic  decrement  of  the  intensity  with  distance  is,  however, 
the  quantity  usually  defined  as  the  absorption  coefficient  a^  for  light  of 
a  given  frequency,  and  is  the  quantity  ordinarily  determined  in  absorp- 
tion experiments.     So  that  we  now  have  the  useful  equation 

hv/dNi^  dNij.        \  ,,,., 
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connecting  Einstein's  coefficients  with  the  experimentally  measured 
quantity  a, . 

216.  Calculation  of  Einstein's  Coefficients  from  Absorption 
Measurements. — We  shall  now  be  able  to  calculate  the  value  of  Ein- 
stein's coefficients  from  intensity  measurements.  If  we  plot  the  measured 
values  of  a^  as  a  function  of  v  across  the  whole  width  of  an  absorption 
line,  and  integrate  under  the  curve,  we  evidently  obtain  in  accordance  with 
equation   (364) 

r   ^       rhv  /dNi  „  dNj  ^      \  ,  ,,^„, 

jaJv=j-i^-^B.^,—^B,^jdr  (36.) 

Since  in  most  cases  v  varies  but  slightly  in  crossing  the  effective  width 
of  the  line,  and  since  in  accordance  with  Section  214  we  shall  take  B.i-^j 
and  Bj-^i  as  constants,  we  can  rewrite  this  in  the  form 

^^-^[B,^,Ni  —  B,^iNj]  (366) 

Since  in  general  the  intensity  of  the  light  used  in  absorption  experi- 
ments is  not  enough  to  cause  any  appreciable  change  in  the  number  of 
molecules  in  different  quantum  states  from  that  existing  at  equilibrium, 
we  can  substitute  for  the  ratio  of  Ni  to  A'^^  that  given  by  the  Maxwell- 
Boltzmann  distribution  law  as  expressed  by  equation  (350).  Further- 
more, we  can  substitute  for  Bj-*i  in  terms  of  Bi-^j  in  accordance  with 
equation  (356).  Doing  this  and  solving  for  the  thing  of  interest,  the 
coefficient  of  absorption  Bi^j,  we  obtain 


B 


hvNi  (  1  —  e     it) 


fa,  dv  (367) 


At  ordinary  temperatures,  even  at  frequencies  as  low  as  in  the  near  infra 
red,  the  value  of  v  over  T  is  large  enough,  so  that  this  equation  can  often 
be  simplified  to  the  form 

Bi^j  =  -J^J  ^"^'^  (268) 

The  possibility  thus  provided  of  calculating  the  experimental  value 
of  Einstein's  coefficient  Bi^j  from  measurements  of  the  intensity  of 
absorption  lines  is  of  great  value  in  the  interpretation  of  atomic  behavior. 
Knowing  the  value  of  Bi-*,,  the  other  two  coefficients  Bj  -*  i  and  Aj^i 
can  of  course  at  once  be  calculated  from  equations  (356)  and  (357). 
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217.  Mean  Life  of  Molecules  in  Activated  States. — Since  Aj  -» i 
is  the  specific  rate  for  the  spontaneous  transition  of  molecules  by  emission 
of  radiation  from  state  /  to  state  i,  it  is  of  course  easy  to  determine  from 
this  quantity  the  mean  duration  of  molecules  in  state  /  before  jumping 
to  state  i.  For  the  number  of  molecules  leaving  state  ;  in  any  time 
interval  dt  we  have 

—  dNj  =  NjAj^idt  (369) 

and  integrating  obtain  for  the  number  present  at  any  time  t 

Nj=(Nj)oe-''^--^''  (370) 

where  (Nj)o  is  the  number  present  at  the  time  zero.  Hence  for  the  num- 
ber having  a  life  between  t  and  t-}-dt  we  may  write  {N j) Qe~ ^"^ i -^ i*  A  j^idt 
and  for  the  mean  life  x  obtain 

00 

T  =  j^j{N,),e-''i^^'  Aj^itdt  =  -^^  (371) 

0 

The  reciprocal  of  Einstein's  coefficient  Aj  _^i  is  then  the  mean  life 
of  the  molecule  in  state  ;",  provided,  however,  of  course  that  this  transi- 
tion from  state  ;  to  state  i  is  the  only  one  that  can  occur.  If  the  mole- 
cule can  leave  state  ;  by  a  number  of  different  processes,  perhaps  of  a 
collisional  as  well  as  a  radiational  nature,  then  the  actual  mean  life  will 
be  correspondingly  shortened.  For  example,  if  there  are  a  series  of 
lower  states  i  to  which  a  spontaneous  transition  can  take  place  we  shall 
have 

'  =  1:57:-  <^^2> 

i 

218.    Examples  of  the   Calculation  of   Einstein's   Coefficients. — 

With  the  help  of  equations  (367)  and  (368),  values  of  Einstein's  co- 
efficients have  been  calculated  for  a  considerable  number  of  atomic  and 
molecular  transitions.^ 

As  a  good  example  of  the  method  we  may  consider  the  calculation 
of  the  value  of  Bi^j  for  the  transition  of  mercury  from  the  normal  IS 
to  the  2p2  state,  corresponding  to  the  line  I  2537,  since  the  absorption 
data  of  Fiichtbauer,  Joos  and  Dinkelacker  ^  on  this  line  are  the  best 
that  we  now  have  available.  The  frequency  of  this  line  is  high  enough 
compared   with   the   temperature   at   which   the   experiments   were    made 

"See  Tolman,  Phys.  Rev..  23,  693  (1924);  Tolman  and  Badger,  ibid.,  27,  383 
(1926). 

'Fiichtbauer,  Joos  and  Dinkelacker,  Ann.  der  Physik.,  71,  204  (1923), 
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so  that  we  may  neglect  the  second  term  in  the  denominator  of  equation 
(367)  and  use  equation  (368) 


^'^'  -JivN 


H^J^^'^  (368) 


for  our  calculations.  For  v  we  take  the  frequency  corresponding  to 
2537  A.  For  Ni  we  may  take  the  total  number  of  atoms  of  mercury 
per  cubic  centimeter,  since  the  first  energy  level  of  mercury  above  the 
normal  IS  state  is  so  high  that  there  are  practically  no  atoms  in  any 
but  the  normal  state  at  the  temperatures  employed.  In  the  experiments 
of  Fiichtbauer,  Joos  and  Dinkelacker  this  number  could  be  calculated 
from  the  known  vapor  pressure  of  mercury,  since  liquid  mercury  at  a 

definite  temperature  was  present  in  the  apparatus.    The  value  of    I   a^dv 

was  determined  by  these  authors  by  graphical  integrations  under  the 
absorption  curves  obtained  in  runs  in  which  the  width  of  the  line  was 
artificially  increased  by  the  addition  of  hydrogen,  nitrogen,  argon,  or 
carbon  dioxide  at  various  pressures.  The  final  value  was  determined  by 
extrapolation  to  zero  pressure,  and  is  quite  possibly  correct  within  ten  per 
cent. 

Substituting  the  values  thus  obtained  into  the  equation,  we  obtain 
for  Bi_^j  the  value  2.88  X  10^^  everything  being  expressed  in  c.g.s. 
units.  Taking  the  ratio  of  pi  to  />/  as  1 :  3  ^"  we  can  then  obtain  from 
equation  (357)  for  the  rate  of  spontaneous  decay  of  mercury  in  the  2p2 
state,  Aj  _^i,  the  value  9.68  X  10^  From  this  the  mean  life  t  of  mercury 
atoms  in  the  2p2  state  comes  out,  in  accordance  with  equation  (371) 
1.03  X  10-^  seconds. 

219.  This  latter  value  has  special  interest  as  it  has  been  checked,  since 
its  first  calculation,^^  by  two  independent  methods. 

Thus  Turner  ^^  has  calculated  the  mean  life  t  from  measurements  of 
Cario  and  Franck,  on  the  rate  of  production  of  activated  hydrogen  by 
collisions  with  mercury  vapor  which  is  activated  by  k  2537.  It  is  evi- 
dent that  this  rate  will  be  dependent  on  the  mean  life  t  for  the  excited 
atoms,  since  in  order  to  activate  hydrogen  they  must  collide  with  a 
hydrogen  molecule  before  losing  their  excitation  by  spontaneous  emis- 
sion. By  comparing  the  rates  of  reaction  at  dififerent  pressure  of  hydro- 
gen Turner  was  thus  able  to  determine  that  the  mean  life  x  must  be  in 
the  neighborhood  of  lO"'^  seconds. 

"According  to  Pauli,  Zeitschrift  fiir  Physik,  31,  765  (1925),  the  number  of 
stationary  states  corresponding  to  the  quantum  number  ;  which  gives  the  total 
angular  momentum  of  the  atom  is  2;  -|-  1.  The  IS  level  has  the  value  ;  =  0  and 
the  2p2  level  the  value  /  =  1. 

"  Communicated  to  the  National  Academy,  Jan.  17,  1924. 

"Turner,  Phys.  Rcr.,  23,  464  (1924). 
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Determinations  of  the  mean  life  of  atoms  in  excited  states  can  also 
be  obtained  by  observing  the  rate  of  decay  in  the  intensity  of  the  Hght 
emitted  by  canal  rays,  as  the  excited  atoms  causing  these  rays  travel 
back  of  the  slit  through  the  cathode.  Measurements  of  this  kind  were 
first  made  by  Wien  and  gave  indeed  the  earliest  determinations  that  we 
have  for  values  of  t.  In  the  case  of  mercury  excited  to  the  2p2  state, 
Wien  ^^  obtains  in  this  manner  the  value  x  =  0.98  X  10"^  seconds  in 
complete  agreement  with  that  calculated  above. 

220.  As  another  example  of  the  calculation  of  Einstein's  coefficient, 
we  may  consider  the  transition  of  hydrogen  chloride  molecules  from  one 
rotational  state  to  another.  These  transitions  give  rise  to  lines  in  the 
far  infra  red,  the  absorption  for  which  have  recently  been  measured 
by  Czerny."  A  high  degree  of  accuracy  can  not  be  claimed  for  the 
results,  owing  to  the  extreme  difficulty  of  working  in  the  far  infra  red. 
The  values  of  Einstein's  coefficients  that  can  be  calculated,  however, 
are  of  special  interest  because  of  the  simplicity  of  the  model,  that  of  a 
rigid  dipole  rotator,  which  can  be  used  for  their  interpretation. 

For  these  calculations  the  energy  levels  involved  are  both  so  low  that 
there  are  an  appreciable  number  of  molecules  in  both  states  i  and  ;',  and 
the  reinforcement  of  the  primary  beam  in  the  absorption  measurements 
by  induced  emission  is  quite  appreciable.  Hence  equation  (367)  in  its 
complete  form 

Bi-.,= -jT^ ^r«.^v  (367) 

hvNi\l  —  e~Fry 

must  be  used. 

Czerny's  data  permits  the  application  of  this  equation  to  lines  Nos.  8, 
9,  and  10  in  the  pure  rotational  spectrum  of  hydrogen  chloride  in  which 
the  absorption  induces  transitions  such  that  the  rotational  quantum  num- 
ber m  changes  from  7.5  to  8.5,  8.5  to  9.5,  and  9.5  to  10.5.  To  determine 
the  number  of  molecules  Ni  in  the  lower  quantum  state  for  the  differ- 
ent lines,  we  may  take  the  total  concentration  of  the  hydrogen  chloride 
vapor  and  then  calculate  with  the  help  of  the  Maxwell-Boltzmann  dis- 
tribution law  as  given  by  equation   (186)    in  Chapter  8  the  number  in 

the  desired  level.     The  value  of  the  integral    I   a^dv  can  be  determined 

graphically  from  the  data  presented  in  Czerny's  curves  giving  the  per- 
centage transmission  for  a  tube  of  given  length  as  a  function  of  wave 
length.  Substituting  into  equation  (367),  we  obtain  for  the  three  transi- 
tions in  question  the  values  of  i?,-^y  5.6  X  10'^  6.8  X  10^''  and  10.6  X  10'* 

"'W.  Wien,  Ann.  dcr  Phvsik,  73,  483   (1924). 
"Czerny,  Zeitschrift  fur  Physik,  34,  227  (1925). 
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everything  being  expressed  in  c.g.s.  units.  These  values  are  subject  to 
considerable  uncertainty  owing  to  the  difficulties  involved  in  Czerny's 
work,  and  to  a  slight  degree  owing  to  doubt  as  to  the  correct  a-priori 
probabilities  to  use  in  calculating  the  values  of  Ni. 

The  above  values  are  of  special  interest,  since  the  simple  dipole  rotator 
which  can  be  taken  as  the  model  involved  has  made  it  possible  to  cal 
culate  theoretically  for  the  first  time  the  expected  absolute  values  B^_^, 
from  the  correspondence  principle.  This  has  been  done  ^^  and  the  theo- 
retical values  of  Bi^j  11.9  X  10^°,  11.8  X  10^«  and  11.6  X  10'"  obtained 
for  the  transitions  in  question.  These  results  are  also  subject  to  some 
uncertainty  owing  to  doubt  as  to  the  correct  a-priori  probabilities  and 
as  to  the  proper  value  to  use  for  the  electric  moment  of  the  dipole.  The 
approximate  check,  however,  should  be  noted. 

It  is  also  interesting  to  calculate,  from  the  above  values  of  B^^j,  the 
values  of  ^,-_»,-  for  spontaneous  transitions  in  the  opposite  direction,  and 
the  values  for  the  mean  life  of  the  molecules  in  the  upper  quantum 
states.  We  find,  what  at  first  sight  must  seem  the  surprising  result,  that 
the  mean  life  of  the  atoms  in  the  upper  states  involved  is  of  the  order 
of  seconds.  Thus  the  mean  life  of  hydrogen  chloride  in  the  quantum 
state  m  =  9.5  comes  out  27  seconds  as  calculated  from  the  experimental 
v^lue  of  Bi^j-  given  above.  This  means  of  course  that  transitions  be- 
tw^en-Jijedififerent  rotational  states  presumably  occur  in  the  gas  under 
ordinary  con^iHonsTmlclrTnore  frequently  through  collision  than  through 
interaction  with  radiation.  There  is  no  theoretical  reason  why  such  a 
condition  of  affairs  should  not  be  the  case,  since  for  example  in  the  case 
of  the  hydrogen  molecule  we  know  from  specific  heat  data  that  rotational 
states  do  exist,  but  from  the  homopolar  nature  of  the  molecule  we  are 
certain  that  transitions  between  these  states  by  interaction  with  radiation 
will  almost  never  take  place. 

221.    Range  of  Values  for  Einstein's  Coefficient  of  Absorption 

JBj_^y. — A  knowledge  of  the  probable  range  of  values  for  the  coefficient 
of  absorption  Bi_^j  will  be  of  value  for  our  later  interpretations  of  the 
rate  of  chemical  reactions,  since  it  will  permit  us  to  estimate  the  prob- 
ability that  chemical  activation  occurs  by  the  absorption  of  radiation. 

By  the  methods  illustrated  above,  experimental  values  of  Bi-^i  have 
been  calculated  for  a  considerable  number  of  transitions  and  found  to 
range  all  the  way  from  10^"*  to  2.3  X  10^^  c.g.s.  units.  In  addition  it  is 
possible  theoretically  to  estimate  a  probable  upper  limit  for  values  of 
Bi^j  with  the  help  of  the  correspondence  principle. 

Consider  the  transition  between  an  upper  quantum  state  /  and  a  lower 
state  i.     If  we  have  Nj  molecules  in  the  upper  state,  then   the  rate  of 

^'Phys.  Rev.,  27,  383  (1926). 
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spontaneous  emission  of  energy  due  to  the  transition  in  question  will  have 
on  the  basis  of  the  quantum  theory  the  value 

^  =  NjA,-^ihv  (373) 

On  the  other  hand,  on  the  basis  of  the  classical  theory  the  rate  of  spon- 
taneous emission  of  Nj  oscillators  having  the  frequency  v  would  be 

■dr=^'(T?r-jO--'  (374) 

where  e  is  the  charge  of  the  oscillating  particle  and  Q  the  amplitude. 

On  the  basis  of  the  correspondence  principle  we  should  expect  an 
approximate  equality  between  these  two  expressions  provided  we  take  Q 
as  the  amplitude  of  that  particular  harmonic  in  the  internal  motion  of 
the  molecule  whose  order  numbers  Ti  .  .  .  .  Ts  are  the  same  as  the  changes 
in  quantum  number  An^ .  .  .  .  Aug  which  characterize  the  quantum  transi- 
tion.    Equating  the  two  expressions  and  solving  for  Aj-^i  we  obtain 

or  introducing  in  accordance  with  equation    (357)    the   relation 

Aj-.i=—^Bi^j  (376) 

where  for  simplicity  we  have  considered  the  system  as  non-degenerate  so 
that  the  a-priori  probabilities  are  unity,  we  can  obtain 

B.^j=^Q^  (377) 

The  correct  manner  of  calculating  0  has  not  been  completely  specified, 
nevertheless  we  may  expect  that  the  upper  limit  for  this  quantity  will 
be  of  the  order  of  molecular  dimensions.  If  then  we  take  Q  as  3  X  10"® 
cm.  and  e  as  the  ch?rge  of  the  electron  we  obtain 

Bi^i  =  10-°  c.g.s.  units  (378) 

and  should  not  expect  to  encounter  values  of  this  quantity  greatly  in 
excess  of  this  amount.  We  shall  find  a  later  application  for  this  im- 
portant though  approximate  conclusion. 


Chapter  17 
Collisions  of  the  First  and  Second  Kinds 

222.  Introduction. — The  principle  of  microscopic  reversibility,  dis- 
cussed in  Chapter  15,  makes  it  possible  to  draw  some  interesting  and 
important  conclusions  as  to  the  probability  of  collisions  in  which  mole- 
cules are  transferred  from  one  quantum  state  to  another.  These  con- 
clusions were  first  obtained  by  Klein  and  Rosseland  ^  by  a  method  of 
reasoning  similar  to  but  not  quite  identical  with  that  which  we  shall 
present. 

In  connection  with  the  phenomena  to  be  discussed  it  has  become  cus- 
tomary to  employ  the  terms  elastic  collisions,  and  non-elastic  collisions 
of  the  first  and  second  kinds.  By  an  elastic  collision  we  shall  mean  one 
in  which  the  total  kinetic  energy  of  the  two  colliding  molecules  is  not 
changed  as  a  result  of  the  impact.  By  a  collision  of  the  first  kind  we 
shall  understand  one  in  which  all  or  a  portion  of  the  relative  kinetic 
energy  of  the  molecules  before  their  impact  is  used  in  raising  one  of 
the  molecules  from  a  lower  to  a  higher  quantum  state.  Such  collisions 
are  thus  somewhat  similar  to  the  inelastic  collisions  of  everyday  me- 
chanics. By  a  collision  of  the  second  kind  we  shall  understand  one  in 
which  one  of  the  molecules  is  at  the  start  in  an  activated  or  higher 
quantum  state  and  falls  to  a  lower  state  as  a  result  of  the  impact,  trans- 
ferring all  or  a  portion  of  the  energy  of  activation  into  the  kinetic  form. 
Such  collisions  are  hence  "superelastic," 

223.  The  simplest  examples  of  these  three  kinds  of  collisions  are  fur- 
nished by  the  interaction  of  atoms  with  moving  electrons. 

If  we  have  a  tube  containing  a  dilute  gas,  for  example  mercuiy  vapor, 
and  provided  with  a  source  of  electrons  such  as  a  hot  filament,  it  is 
well  known  that  for  low  velocities  of  the  electrons  only  elastic  collisions 
will  be  made  with  the  mercury  atoms,  the  kinetic  energy  of  the  electrons 
being  practically  unchanged  by  the  impacts,  since  the  mercury  atoms  are 
so  heavy  compared  with  the  electrons  as  to  suffer  only  a  negligible  change 
in  kinetic  energy.^ 

^  Klein  and  Rosseland,  Zeitschrift  fur  Physik,  4,  46  (1921).  See  also  Fowler, 
Phil.  Mag..  47,  257  (1924). 

"In  the  case  of  collisions  between  electrons  and  lighter  atoms  such  as  helium 
it  sometimes  becomes  necessary  to  consider  the  small  kinetic  energy  transferred 
to  the  helium  atom  even  in  an  elastic  collision. 

i8l 


182  STATISTICAL  MECHANICS  ■  [Ch.  17 

By  accelerating  the  electrons,  however,  through  a  proper  potential 
drop  they  can  be  given  sufficient  kinetic  energy  so  that  on  impact  with 
the  mercury  atoms  they  can  transfer  the  necessary  energy  to  the  mercury 
atom  to  raise  the  latter  to  a  higher  quantum  state,  and  collisions  of  the 
first  kind  will  then  in  general  take  place.  In  the  case  of  mercury,  these 
occur  when  the  electrons  have  received  the  kinetic  energy  correspond- 
ing to  a  potential  drop  of  about  4.7  volts  which  is  sufficient  to  knock 
the  atom  from  the  normal  16^  to  the  2pz  state,  which  is  the  next  lowest 
quantum  state  for  this  atom.  Processes  of  this  nature  are  the  simplest 
examples  of  collisions  of  the  first  kind.  They  provide  as  a  matter  of 
fact  the  most  unequivocal  evidence  we  have  as  to  the  real  existence  of 
quantized  energy  levels  in  atoms. 

The  existence  of  these  collisions  of  the  first  kind  between  electrons 
and  atoms,  however,  also  necessitates  the  existence  of  collisions  of  the 
second  kind,  since  in  accordance  with  the  principle  of  microscopic  re- 
versibility we  must  expect  the  possibility  of  a  reversal  of  the  foregoing 
process  of  a  nature  such  that  excited  atoms,  coming  in  contact  with  slow 
moving  electrons,  drop  back  to  a  lower  quantum  state  and  give  their 
energy  of  excitation  up  to  the  electrons  in  the  form  of  increased  velocity. 

224.  Collisions  of  these  three  kinds  also  undoubtedly  occur  between 
pairs  of  atoms  or  molecules  as  well  as  between  an  atom  and  an  electron. 
Most  of  the  collisions  between  atoms  and  molecules  are  presumably  elastic, 
not  unlike  those  usually  considered  in  the  older  kinetic  theory,  but  very 
clear  cut  examples  of  collisions  of  the  second  kind  between  atoms  have 
been  found ;  and  their  existence  evidently  guarantees,  by  the  principle  of 
microscopic  reversibility,  the  occurrence  of  collisions  of  the  first  kind.  The 
unequivocal  demonstration  of  collisions  of  the  second  kind  between  atoms 
was  first  obtained  by  Carlo, ^  who  showed  that  mercury  atoms  raised  to 
the  2p2  state  by  the  absorption  of  /I  2537  could  transfer  a  portion  of  their 
energy  of  excitation  to  thallium  atoms  putting  the  latter  into  a  condition 
to  emit  lines  of  the  thallium  spectrum,  the  excess  energy  of  the  mercury 
atom  being  transferred  into  the  kinetic  form.  The  work  of  Carlo  also 
actually  gave  some  direct  evidence  of  the  existence  of  collisions  of  the 
first  kind  between  mercury  and  thallium  atoms,  since  in  some  cases  the 
thallium  atoms  were  raised  to  quantum  states  of  slightly  higher  energy 
than  that  of  the  excited  mercury  atom.  This  means  of  course  that 
the  kinetic  energy  of  the  atoms  had  to  be  called  upon  to  supply  some  of 
the  energy. 

225.  Probability  of  a  Collision  of  the  First  Kind  between  an 
Atom  and  a  Fast  Electron. — We  are  now  ready  to  deduce  the  Klein 

'Cario,  Zeitschr.  fur  Phys.,  10,  185   (1922). 
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and  Rosseland  relation  between  the  probabilities  for  the  occurrence  of 
collisions  of  the  first  and  second  kinds  between  atoms  and  electrons. 
Consider  a  gaseous  system  of  unit  volume  containing  under  equilibrium 
conditions  A^e  electrons  and  Ni  atoms  in  a  lower  quantum  state  i,  which 
can  be  raised  to  a  higher  quantum  state  /  by  collisions  of  the  first  kind 
with  the  electrons.  Since  the  atoms  are  very  heavy  compared  with  the 
electrons,  we  can  regard  them  as  fixed  and  consider  only  the  velocities 
of  the  electrons  as  determining  the  chance  of  collision  and  the  amount 
of  energy  available. 

In  order  for  an  electron  to  be  able  to  produce  a  collision  of  the  first 
kind,  it  must  have  at  least  the  kinetic  energy  Sij  which  is  necessary  to 
raise  the  atom  from  the  state  i  to  the  state  /,  and  the  probability  of  a 
successful  collision  will  evidently  be  some  function  of  the  excess  energy 
which  it  has  over  this  amount.  Let  us  then  consider  the  probability 
of  collisions  of  the  first  kind  by  electrons  lying  in  the  energy  range  from 
(eij -\- e)  to  (sij  -\-  s  -\-  de).  The  number  of  such  electrons  present  in 
the  system,  from  Maxwell's  distribution  law  (see  equation  (94)  Chapter 
5),  will  evidently  be 

ft)  +  e       

dNe  =  Neae        kT    Vstj  -f  eds  (379) 

where  a  is  a  parameter  depending  on  the  temperature.  The  chance  that 
one  of  these  electrons  will  make  any  collision  with  an  atom  in  the  state  i 
will  evidently  be  proportional  to  the  number  of  electrons  and  atoms,  to 
the  cross  section  of  the  atoms,  and  to  the  velocity  \/2{sij  +  s)/m  with 
which  the  electrons  are  moving.  Hence  for  the  number  of  collisions  of 
the  first  kind  occurring  per  second  in  which  these  electrons  induce  the 
change  from  the  state  i  to  ;',  we  may  evidently  write 

e<i  +  e     [2 

Z^uj  =  P^ujN^iNeac       kT    ^—(eij-\-e)de  (380) 

J  in 

where  the  probability  factor  P+i+j  may  be  thought  of  as  the  product  of 
the  cross  section  of  the  atom  multiplied  by  the  chance  that  a  collision 
will  be  effective.  This  quantity  P+i+j  is  evidently  a  function  of  the 
energy  of  the  electron  (eij-\-e).  (The  plus  signs  in  the  subscripts 
have  been  introduced  since  we  shall  desire  to  distinguish  between  the 
states  i  and  ;  of  an  atom  and  the  reverse  states  — i  and  — /.) 

226,  Probability  of  a  Collision  of  the  Second  Kind  between  an 
Atom  and  a  Slow  Electron. — Let  us  now  consider  the  reverse  of  the 
foregoing  process,  such  that  an  electron  with  kinetic  energy  in  the  range  e 
to  e  -\-  de  collides  with  an  excited  atom  in  the  reverse  state,  — ;',  and 
through  a  collision  of  the  second  kind  receives  the  energy  eij,  the  atom 
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falling  to  the  reverse  of  the  previous  original  state,  — i.  Evidently  for 
the  number  of  such  processes  taking  place  in  unit  time  we  can  write  in 
analogy  to  equation   (380)   the  expression 

Z_.  i  =  P^j^i  N.jNeac~Tf\-  eds  (381) 

I  m 

where  P.j.i  is  the  probability  factor  for  the  reverse  change. 

227.  Relation  between  the  Two  Probabilities. — In  accordance  now 
with  the  principle  of  microscopic  reversibility  these  two  rates,  which  are 
for  processes  that  are  the  reverse  of  each  other,  must  be  equal,  so  that 
by  combining  equations  (380)  and  (381)  and  cancelling  common  fac- 
tors, we  can  obtain 

lii  ( 382 ) 

P^ui  N^iisij  -\-s)  =  P_i_i  N.jS  ekT  ^       ^ 

This  equation  connecting  the  probabilities  of  the  two  kinds  of  collision 
will  be  unfamiliar,  since  it  connects  the  probability  of  transition  from 
state  i  to  ;',  not  as  is  usual  with  the  probability  of  transition  from  state  / 
to  i,  but  with  the  probability  of  transition  from  the  reverse  state  ■ — / 
to  the  reverse  state  —  i.  The  equation  is,  however,  the  most  immediate 
conclusion  that  we  can  draw  from  the  principle  of  microscopic  reversibility 
stated  in  its  exact  form.  Moreover,  the  method  of  derivation  avoids 
the  difficulty,  correctly  raised  by  Fowler  (/.  c.)  as  to  the  justification 
in  the  usual  derivation  of  equating  the  number  of  collisions  in  which 
molecules  go  from  state  +  ^  to  -{-  j  with  the  number  in  which  molecules 
go  from  +  y  to  +  h  since  in  the  case  of  atoms  with  more  than  two  levels, 
say  i,  j,  k  the  equilibrium  might  be  maintained  by  cyclical  processes  of  the 
type  i^>  j  —^  k  —^i. 

228.  If  now  we  desire  to  obtain  an  expression  which  makes  no  dis- 
tinction between  reverse  states,  we  may  proceed  as  was  done  in  deriving 
Corollary  a  to  the  principle  of  microscopic  reversibility,  by  noting  that 
an  expression  similar  to  equation  (382)  must  evidently  hold  for  the 
pair  of  transitions,  — i  to  — /  and  ;  to  i,  and  indeed  also  for  the  pairs 
—  i  to  j,  — y  to  i  and  i  to  — ;,  /  to  — i  if  these  transitions  occur.  Hence 
by  adding  together  these  similar  expressions  we  can  obtain  the  equation 

P.i.i^.ii^ii  +  £)  =  P.j.i^.j^^^^  (383) 

where  the  symbolism  is  evident,  P±i±j  ,  for  example,  being  the  total 
probability  of  a  collision  of  the  first  kind  in  which  an  atom  goes  from 
either  of  the  states  t  or  — i  to  either  of  the  states  ;  or  — /.     Nothing, 


226-230]   COLLISIONS  OF  FIRST  AND  SECOND  KINDS        185 

however,  will  now  be  gained  by  retaining  the  complicated  symbolism  so 
that  we  may  rewrite  equation  (383)  in  the  usual  form 

PijNi(eij  +  e)  =  PjiNjeekT  (384) 

This  expression  can  be  further  simplified  by  substituting  for  the 
ratio  Nj  over  Ni,  which  in  accordance  with  the  Maxwell-Boltzmann  dis- 

tnbution  law  has  the  value  {pj/pi)e      kT  giving  us  as  our  final  expression 

piPiiisij  +  £)  =  pjPjiE  (385) 

where  pi  and  pj  are  the  a-priori  probabilities  of  the  two  states. 

This  equation  connects  in  a  simple  manner  the  probability  Pi,  that 
impact  between  an  atom  in  state  i  and  an  electron  with  kinetic  energy 
Eij  -f  £  will  result  in  a  collision  of  the  first  kind,  with  the  probability  Pji 
that  impact  between  an  atom  in  state  /  and  an  electron  with  kinetic  energy 
6  will  result  in  a  collision  of  the  second  kind.  The  equation  has  some- 
times been  called  a  necessary  consequence  of  thermodynamic  reasoning. 
Using  the  terms  in  their  English  sense,  however,  it  is  evident  that  the 
equation  is  to  be  regarded  as  the  result  of  statistical  rather  than  thermo- 
dynamic theory. 

229.  The  equation  was  deduced  on  the  assumption  of  atoms  which 
could  be  regarded  as  stationary  compared  with  the  electrons.  It  can 
easily  be  shown,  however,  as  has  been  done  by  Fowler  (/.  c),  that  the 
same  equation  will  hold  when  we  remove  this  restriction,  providing  we 
understand  by  (sij  -{-  s)  and  £  the  mutual  kinetic  energy  of  the  atom 
and  electron  due  to  their  motion  relative  to  their  center  of  gravity.  This 
arises  from  the  fact  that  the  number  of  collisions  between  molecules  in 
which  the  mutual  kinetic  energy  lies  in  the  range  de,  has  the  same  form 
as  equations   (380)   and   (381).      (See  equation  122,  Chapter  5.) 

230.  Experimental  Example  o£  a  Collision  of  the  First  Kind. — 

At  the  present  time  the  most  satisfactory  determination  of  the  prob- 
ability of  a  collision  of  the  first  kind  has  been  carried  out  by  Dr.  George 
Glockler.*  The  work  was  done  with  helium,  using  the  transition  be- 
tween the  normal  IS  state  of  parhelium  and  the  metastable  2s  state  of 
orthohelium,  which  occurs  when  the  impinging  electrons  are  given  the 
velocity  corresponding  to  a  potential  drop  of  19.77  volts.  It  was  definitely 
shown  by  this  work,  as  had  been  suspected  before,  that  successful  col- 
lisions of  the  first  kind  can  occur  only  when  the  electrons  have  very 
close  to  the  exact  energy  (19.77  volts)  necessary  for  the  transition.  The 
electron  stream  used  in  the  experiments  came  from  a  hot  filament  and 

*  To  be  published  in  the  Physical  Review. 
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hence  there  was  the  usual  temperature  distribution  in  the  energies  of  the 
electrons.  Treating  the  electrons,  however,  as  divided  into  energy  groups 
each  of  0.1  volt  range,  it  was  possible  to  show  that  all  the  successful 
collisions  of  the  first  kind  occurred  with  electrons  having  energies  in 
the  lowest  possible  range  from  19.77  to  19.87  volts,  electrons  of  greater 
energy  causing  within  the  limits  of  error  no  transitions  to  the  higher 
state  at  all.  It  was  also  possible  to  show  that  even  for  the  electrons 
in  this  range  approximately  98.9  per  cent  of  the  actual  collisions  are 
elastic,  only  1.1  per  cent  leading  to  transition  to  the  higher  state.  This 
may  indicate  that  only  those  electrons  which  have  even  much  closer  values 
than  0.1  volt  to  the  right  energy  can  produce  the  transition. 

231.  By  making  use  of  the  relation  which  we  have  derived  between 
the  probabilities  for  collisions  of  the  first  and  second  kind,  it  is  also 
possible  to  obtain  from  the  above  facts,  some  information  as  to  the  prob- 
ability of  collisions  of  the  second  kind.  Referring  to  equation  (385),  it 
is  evident,  since  collisions  of  the  first  kind  only  occur  within  a  range 
of  energies  0.1  volt  greater  than  the  resonance  energy,  that  collisions 
of  the  second  kind  will  only  occur  with  very  slow  electrons,  in  the  range 
between  zero  and  0.1  volt.  This  conclusion,  however,  does  not  seem 
entirely  surprising.  An  electron  with  the  energy  corresponding  to  0.1 
volt  is  moving  with  a  velocity  of  1.9  X  10^  centimeters  per  second  and 
hence  would  remain  in  the  neighborhood  of  the  excited  atom  only  for  a 
very  short  time.  If  we  assume  that  the  electron  in  the  excited  orbit  is  mak- 
ing say  10'  revolutions  per  second,  the  impinging  electron  would  move  1.9 
centimeters  even  in  the  short  time  taken  by  one  revolution,  and  hence 
if  very  close  or  prolonged  interaction  between  the  two  electrons  should 
be  necessary  for  deactivation,  we  see  that  there  would  be  Httle  chance  for 
such  action  even  when  the  impinging  electron  has  as  low  a  velocity  as 
that  due  to  a  potential  drop  of  0.1  volt.  This  explanation  is  of  course 
by  no  means  certain  or  convincing,  but  has  some  interest. 

232.  We  may  also  draw  a  conclusion  as  to  the  functional  relation 
between  the  probability  of  a  collision  of  the  second  kind  and  the  energy 
of  the  impinging  electron.     Solving  equation    (385)    for  Pij  we  obtain 

F,,  =  ^  -4-  Pji  (386) 


which  within  the  range  of  e  of  0.1  volt  can  be  written  in  the  form 

Pij  =  ^-^  Pji  (387) 

Pi    ^ij 

The   experimental    facts,    however,    show   us    that   the   probability   of    a 
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collision  of  the  first  kind  Pa  reaches  its  maximum  value  when  e  is 
less  than  0.1  volt,  so  that  we  must  conclude  that  within  this  range  the 
probability  of  a  collision  of  the  second  kind  Pji  must  decrease  more 
rapidly  than  inversely  as  the  first  power  of  the  energy  e.  This  con- 
clusion which  seems  inevitable  is  surprising,  since  it  might  have  seemed 
natural  to  expect  that  the  probability  would  decrease  inversely  as  the 
velocity  or  one  half  power  of  the  energy.  Until  we  are  able  to  get 
evidence  as  to  the  exact  form  of  the  functional  relation  between  Pa 
and  E,  further  conclusions  can  hardly  be  drawn. 

233.  Probability  of  Ionization  by  Electron  Impact. — It  is  also 
possible  to  use  similar  methods  to  the  foregoing  to  obtain  information 
as  to  the  relation  between  the  probabilities  of  ionization  and  recapture 
by  electron  impact,  another  case  of  collisions  of  the  first  and  second 
kinds  first  treated  by  Fowler  (/.  c).  Consider  a  system  of  unit  volume 
under  equilibrium  conditions  containing  A^'e  electrons  and  AU  atoms  in  a 
quantum  state  i,  which  are  capable  of  being  ionized  by  electron  col- 
lision. If  Sx  is  the  energy  necessary  for  ionization,  we  may  think  of  the 
ionization  process  as  commencing  with  an  atom  in  the  state  i  and  an 
electron  having  the  energy  e^  +  ^i  and  ending  with  the  atom  in  the 
ionized  state,  the  colliding  electron  having  dropped  in  energ}'  content  from 
E^  -}-  El  to  some  new  value  £2,  and  the  dislodged  electron  going  ofif  with 
the  energy  £3,  where 

E,  =  E,^E,  (388) 

(We  thus  rule  out  the  conceivable  case  that  part  of  the  energy  goes  off 
in  the  form  of  radiation.) 

For  the  number  of  electrons  in  the  range  e.^^  -f-  £1  to  £^  +  £1  +  dst 
we  may  write 

dNe  =  Ncae~  '~^hpr^  V£^-f  £1  dE^  ( 389 ) 

and  since  their  velocity  is  y/2{E~\^~Ei)J7n  we  may  write  for  the  num- 
ber of  collisions  per  second  of  the  nature  described  above,  the  expression 

ego  +  gi     pZ 

Z^u-^^=P^u,-^jEoN^iNcac       kT    \—  (e^  +  Ei)dEi  (390) 

J  m 

where  the  probability  factor  P+i+co  rnay  be  thought  of  as  the  product  of 
the  cross  section  of  the  atom  multiplied  by  the  chance  that  a  collision 
will  be  successful  and  of  such  a  nature  that  the  impinging  electron  drops 
in  energy  from  the  range  (/£i  to  the  range  dso.  The  quantity  P+t+cc  is  evi- 
dently a  function  both  of  the  energy  of  the  impinging  electron  (fi^o  +  Ci) 
before  it  collides  and  its  energy  E2  after  the  completion  of  the  collision. 
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234.  Probability  of  Electron  Recapture  by  Triple   Collision. — 

Let  us  now  consider  the  reverse  of  the  foregoing  process,  which  will 
lead  to  the  recapture  of  an  electron  by  an  ionized  atom.  Three  bodies 
will  evidently  be  involved  in  such  a  recapture,  an  ionized  atom  in  the 
state —  CO  which  is  the  reverse  of  that  produced  by  the  previous  process, 
an  electron  with  energy  £3  =  Si  —  £0  which  will  be  recaptured,  and  an  elec- 
tron with  energy  £2  which  will  assist  in  the  recapture  by  carrying  oflf 
the  excess  energy.  The  two  electrons  in  this  reverse  process  will  of 
course  be  moving  at  the  start  towards  the  atom  instead  of  away  from 
it,  as  at  the  end  of  the  ionizing  collision. 

Since  the  chance  that  an  ion  and  the  electron  to  be  bound  are  located 
together  within  any  chosen  small  volume  is  proportional  to  the  product 
of  the  numbers  of  each  present,  and  the  chance  that  such  a  pair  will  be 
hit  by  a  second  electron  is  proportional  to  the  number  of  such  electrons 
and  their  velocity,  we  may  write  for  the  number  of  reverse  processes 
the  expression 

Z.^_i  rr  P-^.i  N.^Neac     krV Bsde^N cac     k-r  \—  Sndsn       (391) 

J  m 

where  the  proportionality  factor  P-^-i  may  be  regarded  as  the  product 
of  the  volume  of  the  sphere  of  influence  within  which  the  ion  and  elec- 
tron to  be  captured  must  be  located  multiplied  by  the  cross  section  of 
this  sphere  and  by  the  chance  that  a  collision  of  the  impinging  electron 
with  this  cross  section  will  lead  to  recapture. 

235.  Relation  betw^een  the  Two  Probabilities. — The  numbers  of 
the  two  kinds  of  processes  occurring  per  second  as  given  by  equations 
(390)  and  (391)  must,  however,  be  equal  in  accordance  with  the  prin- 
ciple of  microscopic  reversibility,  so  that  by  cancelling  equal  factors  and 
introducing  equation  (388),  we  can  obtain 

/'.i.«(eoo+ei)  =P-o..ie,V'^,^^^f^aeTr  (392) 

connecting  the  probabilities  of  an  ionizing  collision  and  its  reverse 
recapture. 

Since  a  similar  equation  will  evidently  hold  for  the  pair  of  transi- 
tions —  i  to —  00 ,  00  to  i,  and  indeed  also  for  the  pairs  —  i  to  00  ,  —  00 
to  i,  and  i  to  —  00,  co  to  — J,  if  these  transitions  exist,  we  can  obtain 
by  combination  an  expression 

Pi^{s^-^ei)=Pooie2VT,-^^^aeJf  (393) 

which  neglects  the  distinction  between  reverse  states. 
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This  equation  can  be  still  further  simplified.  In  accordance  with 
the  Maxwell  distribution  law,  as  given  by  equation  (94)  Chapter  5,  we 
have  for  a  the  value 

a=-^ (394) 

while  the  equilibrium  constant  N^  Ng/Ni  for  the  dissociation  of  an  atom 
into  ion  and  electron  has  in  accordance  with  equation  (318)  Chapter  14 
the  value 


e  CO 


N^Ne       i2nmkT)%_o  ^-  , 

so  that  by  substituting  into  equation   (393)    we  obtain 

Pi.  (fix  +  £i)  =  C  P^i  e,  VT,  (396) 

where  the  constant  factor  C  has  the  value 

^^4V2W^  (397) 

The  above  equation  connects  in  a  simple  manner  the  probability  fac- 
tors for  ionization  by  collision  with  a  fast  electron  and  recapture  by  the 
intervention  of  a  slow  electron.  The  factor  Pice  is  evidently  a  func- 
tion of  the  energy  of  the  impinging  electron  (soo  -|-  Ci)  and  also  a  function 
of  the  energy  of  this  electron  So  after  the  impact,  while  Pco  i  is  a  func- 
tion of  £2  the  energy  of  the  electron  which  assists  in  the  recapture  and  of 
£3  z=  e^  —  £2  the  energy  of  the  electron  which  is  to  be  bound. 

236.  Experimental  Values  o£  Ionization  Efficiency. — As  in  the 
case  of  the  similar  equation  connecting  the  probabilities  of  collisions  of 
the  first  and  second  kinds  between  quantized  states,  we  may  try  to  use 
equation  (396)  to  assist  in  the  interpretation  of  the  experimentally  deter- 
mined values  for  the  chance  of  ionization  by  collision  between  an  atom 
and  a  fast  electron. 

Solving  equation  (396)  for  Pi^  and  substituting  the  value  for  £3  given 
by  (388)  we  obtain 

C 


Pir  =— ^  P^i  So.  V£r=^  (398) 

£,    +  £1 

Now  Piyj  de^  gives  in  accordance  with  our  treatment  the  chance  of  ioniza- 
tion by  collision  with  an  electron  of  energy  (fico  +  ^i)  provided  this 
impinging  electron  drops  to  the  energy  interval  de^.  Hence  integrating 
over  all  possible  values  of  £2  from  zero  to  £1,  we  get  an  expression  for 


190  STATISTICAL  MECHANICS  [Ch.  17 

the  total  probability  of   ionization  by  an  impacting  electron   of   energy 
(e«  +Si),  giving  us 

CPi-,ds,  =  — ^ Cp^i  s,  Vfii  — £2  ds,  (399) 

J  Sj:.  -f-  SiJ 

0  0 

To  try  to  simplify  the  integral  on  the  right  hand  side  of  this  equa- 
tion, we  recall  the  conclusion  which  could  be  obtained  from  Glockler's 
work  that  an  impinging  electron  can  lower  the  quantum  state  of  an 
atom  only  when  it  is  moving  extremely  slowly.  Such  a  lowering  of 
quantum  state,  however,  is  somewhat  similar  to  the  lowering  from  an 
unquantized  orbit  to  a  quantized  state  that  we  are  now  considering. 
Hence  if  we  assume  that  Fcoi  will  have  appreciable  values  only  when 
the  velocity  and  hence  energy  £0  of  the  impinging  electron  is  extremely 
small,  we  see  that  for  all  except  very  small  values  of  £1  we  can  rewrite 
equation  (399)  for  the  total  probability  of  ionization  by  collision  with 
an  electron  of  energy  (£,,_,  -]-  £1)  in  the  form 

(Pi,_  de,  =    ^^^^^    fp^  i  e,deo  (400) 

•/  Er.    -(-    £1-/ 


237.  Numerous  measurements  of  the  probability  of  ionization  by 
electron  impact  have  been  made,  and  in  particular  the  corrected  values 
given  by  K.  T.  Compton  and  Van  Voorhis  ^  seem  reliable.  They  show 
in  general  for  a  considerable  number  of  substances  that  the  probability 
of  ionization  rises  from  zero  when  £1  ==  0  to  a  maximum  when  £1  be- 
comes several  times  as  large  as  £»•  This  indicates  that  the  value  of 
the  integral  on  the  right-hand  side  of  equation  (400)  must  increase 
with  £1,  since  the  factor  before  the  integral  has  its  maximum  at  Si  =  e^. 
The  quantity  behind  the  integral  is  a  function  of  the  energy  of  the 
impinging  electron  So  and  of  the  energy  of  the  electron  to  be  formed 
£3  ^  £1  —  £2.  Since  we  have  reason  to  believe  that  Fr^i  falls  ofif  very 
rapidly  with  £0,  it  may  be  that  the  increase  of  the  integral  with  £1,  indicates 
that  electrons  with  high  energies  £3  ^  £1  —  £2  are  the  most  easily  bound, 
provided  they  are  within  the  assumed  sphere  of  influence.  The  com- 
plicated form  of  the  relation  makes  it  difficult,  however,  to  draw  very 
satisfactory  conclusions.® 

°K.  T.  Compton  and  Van  Voorhis,  Phys.  Rev.,  27,  724  (1926).  The  attempted 
application  of  equation  (400)  assumes  of  course  that  no  radiation  is  given  ofif  in  the 
actual  process  of  ionization  liy  electron  impact. 

*  Since  the  above  was  written,  new  and  apparently  very  precise  measurements  of 
the  efficiency  of  ionization  have  appeared  by  E.  O.  Lawrence,  Phys.  Rev.,  28,  947 
(1926).  If  the  step-like  rise  in  efficiency  found  by  him  proves  to  be  correct,  the 
interpretation  with  the  help  of  equation  399  will  have  to  be  altered. 
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238.  To  conclude  the  chapter,  we  desire  to  emphasize  the  great  im- 
portance of  the  quaHtative  conclusion  that  the  existence  of  collisions 
of  the  first  kind  necessitates  the  existence  of  those  of  the  second  kind 
and  vice  versa.  As  yet  the  quantitative  forms  of  the  relations  between 
the  probabilities  for  the  two  kinds  of  collisions  have  not  been  greatly 
used,  but  this  will  undoubtedly  come  in  the  future,  and  in  the  mean- 
time all  our  thinking  in  the  field  of  chemical  gas  reactions  has  been 
greatly  modified  by  the  new  possibilities. 


Chapter  i8 

Introduction  to  the  Velocity  of  Physical  Chemical 
Processes.     Boltzmann's  H-Theorem 

239.    The  Approach  Towards  the  Most  Probable  Distribution. — 

In  our  introductory  discussion  of  statistical  mechanics,  special  atten- 
tion was  called  to  the  fact  that  this  science  is  superior  to  thermodynamics 
in  leading  to  theoretical  predictions  as  to  the  rate  at  which  physical- 
chemical  processes  proceed,  a  question  almost  untouched  by  thermo- 
dynamics, and  we  shall  now  devote  a  considerable  portion  of  the  remainder 
of  the  book  to  a  discussion  of  the  velocity  of  such  changes. 

The  most  general  prediction  furnished  by  statistical  mechanics  as  to 
the  nature  of  the  physical-chemical  changes  which  will  take  place  in  a 
system  which  has  not  already  come  to  equilibrium,  is  that  the  direction 
of  the  change  will  be  towards  a  more  probable  distribution  of  the  mole- 
cules. Thus  for  example  in  the  case  of  a  gas  in  which  the  molecules 
do  not  have  the  positions,  velocities  and  internal  conditions  correspond- 
ing to  the  Maxwell-Boltzmann  distribution  law,  we  may  expect  in  gen- 
eral a  readjustment  of  the  configuration  in  such  a  way  as  to  conform 
to  the  Maxwell-Boltzmann  distribution. 

The  justification  for  this  prediction  may  be  seen  by  considering  the 
motion  of  the  representative  point  for  the  gas  in  the  generalized  phase 
space  (7-space)  which  may  be  used  for  plotting  the  values  of  the  co- 
ordinates and  momenta  which  determine  the  state  of  the  gas.  Nearly 
the  whole  of  the  volume  of  this  phase  space  corresponds  to  configura- 
tions in  which  the  molecules  have  approximately  the  most  probable  con- 
figuration, and  a  very  rapidly  decreasing  fraction  of  the  space  to  the 
configurations  that  differ  increasingly  from  this  most  probable  or  Max- 
well-Boltzmann condition.  Furthermore,  in  accordance  with  Liouville's 
theorem  the  representative  point  has  no  tendency  to  linger  in  any  par- 
ticular region  of  the  phase  space,  and  in  accordance  with  the  ergodic 
hypothesis  may  be  regarded  as  travelling  on  a  continuous  path  which 
visits  substantially  all  regions  of  the  phase  space  in  succession  before 
returning  to  its  starting  point.  Hence  it  is  evident  that  when  the  rep- 
resentative point  spontaneously  leaves  the  regions  of  the  phase  space 
which   correspond   to   the   Maxwell-Boltzmann   configuration   it   will   re- 

192 
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main  outside  only  for  a  time  which  is  short  compared  with  that  in 
which  the  AJaxwell-Bohzmann  law  is  obeyed,  and  furthermore  that  a 
system  arbitrarily  started  in  a  state  where  it  is  not  in  equilibrium 
when  left  to  itself  will  in  general  move  in  the  direction  of  configurations 
in  which  the  Maxwell-Boltzmann  distribution  prevails. 

The  above  conclusion  is  analogous  to  the  thermodynamic  conclusion 
that  systems  tend  to  readjust  themselves  in  the  direction  of  increasing 
entropy,  and  it  is  apparent  that  the  statistical  mechanical  state  of  maximum 
probability  corresponds  to  the  thermodynamic  state  of  maximum  entropy. 
It  should  be  noted,  however,  that  the  statistical  mechanical  state  of 
maximum  probability  specifies  the  values  of  microscopic  variables  be- 
longing to  the  individual  molecules  of  the  system,  while  the  state  of 
maximum  entropy  merely  specifies  the  values  of  macroscopic  variables 
belonging  to  the  system  as  a  whole,  so  that  even  on  the  basis  of  these 
partial  considerations  we  obtain  from  statistical  mechanics  a  more  com- 
plete account  of  the  result  of  a  physical-chemical  change  than  we  do  from 
thermodynamics. 

Nevertheless,  neither  the  thermodynamic  conclusion  that  systems  tend 
to  change  in  the  direction  of  increasing  entropy,  nor  the  statistical  me- 
chanical conclusion  that  they  move  in  the  direction  of  the  Maxwell- 
Boltzmann  configuration,  provide  information  as  to  the  velocity  or  real 
nature  of  the  process  by  which  physical-chemical  changes  take  place, 
and  we  must  now  proceed  to  our  main  business  of  studying  the  actual 
rate  and  internal  mechanism  by  which  systems  do  approach  the  statistical 
mechanical  state  of  maximum  probability 

Our  first  endeavor  in  this  direction  will  be  a  presentation  of  Boltz- 
mann's  famous  //-theorem,  which  gives  us  an  expression  for  the  rate  at 
which  a  gas  approaches  the  Maxwell-Boltzmann  distribution  owing  to 
the  result  of  the  collisions  between  its  molecules.  The  discussion  will 
not,  as  a  matter  of  fact,  lead  to  numerical  results,  since  the  expression 
for  the  rate  of  approach  contains  coefficients  whose  experimental  value 
is  not  known.  Nevertheless,  the  investigation  is  one  of  such  funda- 
mental theoretical  interest  as  to  form  a  natural  approach  to  our  fur- 
ther discussions  of  rates  of   physical  chemical  change. 

240.  Definition  of  the  Function  H. — Consider  a  system  of  mole- 
cules, containing  a  large  number  of  molecules  of  each  of  the  kinds  present. 
Let  us  consider  the  molecules  of  each  kind  as  uniformly  distributed 
throughout  the  space  available,  and  let  us  specify  the  possible  states  of 
these  molecules,  as  in  Section  198,  Chapter  15,  by  assigning  to  each  state 
a  small  finite  region  in  the  phase  space,  which  gives  the  range 

doj  =  dqi  .  .  .  dqn  dpi  ....  dpn  (401) 
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within  which  the  internal  coordinates  q^  . .  .  .  qn  and  external  and  internal 
momenta  of  the  molecule  pi  ...  .  pn  shall  fall.  Let  A^i,  Nn,  N3  .  . .  .  be 
the  numbers  of  molecules  in  the  different  possible  states,  assumed  by  the 
different  kinds,  and  let  us  define  a  function  H  depending  on  the  way 
the  molecules  are  distributed  among  the  different  states  by  the  equation 

H  =  Ni  log  Ni  +  A^o  log  N2  +  A^3  log  A^3  +  . . .  (402) 

241.  Our  problem  now  will  be  to  calculate  the  probable  rate  with 
which  H  will  change  with  the  time,  if  the  distribution  of  molecules 
in  the  different  states  has  not  already  reached  the  steady  condition  de- 
termined by  the  Maxwell-Boltzmann  law.  Differentiating  equation  (402) 
with  respect  to  the  time,  we  obtain 

or  since  the  total  number  of  molecules  of  each  kind  is  taken  as  constant, 
we  obtain 

242.  Derivation  of  the  H-Theorem. — Let  us  now  investigate  the 
effect  of  collisions  in  producing  changes  in  the  distribution  iVi,  N^,  Ah  ■  ■  ■  ■ 
and  hence  changes  in  the  value  of  H.    Consider  some  particular  collision. 


C::) 


in  which  a  molecule  in  state  a  collides  with  one   in  state  b  to 


produce  molecules  in  states  c  and  d,  and  let  us  put  the  probability  that 
such  a  collision  will  occur  proportional  to  the  number  of  molecules  of 
the  two  kinds  present,  so  that  we  can  write  for  the  probable  rate  of  de- 
crease of  molecules  in  state  a  due  to  this  particular  collision,  an  expression 
of  the  form 

dNa  ^b.  a 


^^    =C:/,NaN,  (405) 

where  C  |!'  ^  is  the  probability  coefficient  for  the  collision  and  the  sign 
of  partial  differentiation  has  been  introduced  to  remind  us  that  the 
number  of  molecules  in  state  a  can  be  changed  by  other  kinds  of  collisions 
besides  this  particular  one. 

Since  the  above  collision  destroys   molecules  in   states   a  and   b   and 
produces  them  in  states  c  and  d,  we  see  by  referring  to  equation   (404) 
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that   the   occurrence   of    such   a   colHsion    will    evidently   contribute    four 

dH                                                                      (/// 
terms  to  the  value  of    -r—  and  v^e  may  w^rite   for  the  part   of  due 

to  this  collision  the  expression 

(^)„  ^  =  Cl'  ;  NaN,{\ogNo  +  log  A^,- log  A^,  — log  WO 


,d-^\  MM^  I  (406) 


Proceeding  next  to  the  corresponding  collision  (  '  I  we  shall  obtain 
the  contribution 

[-dr)d.c  =  ^^:f''^[NwJ  (407) 

e,   f 

and  by  adding  together  the  contributions  made  by  all  possible  collisions 
we  shall  obtain  a  total  value  for  the  rate  of  change  of  H  produced  by 
collisions. 

In   Chapter    15,    however,    we   have   shown   in    Section    196    that    the 

corresponding  collisions    ly^h    (pf)'    (    '  j,Y  ^^c,  form  a  closed 

(n,  m  \ 
j   such  that  the  molecules 

rearrive  in  their  initial  states  a  and  b.  Furthermore  in  accordance  with 
equation  (338),  Section  198,  the  probability  coefficients  C  '''  ?,  C''"  ^,  etc., 
for  such  a  cycle  of  collisions  all  have  the  same  value.  Hence  refernno- 
again  to  equations  (406)  and  (407),  it  is  evident  that  we  can  write  as  a 
value  for  the  total  rate  at  which  H  is  changing  with  the  time,  due  to  col- 
lisions, the  expression 


dH 

dt 


where  the  summation  sign  is  introduced  to  indicate  that  we  must  include 
all  possible  such  cycles  of  corresponding  collisions. 

For  simplicity  let  us  use  the   letters   a,   /5,   y,   etc.,  to   represent  the 
products  NaNh,  NcNa,  NgNf,  etc.,  and  rewrite  equation  (408)  in  the  form 

f— (-frfiXfir ©xr 

=  2  C  log  a''-''  /5"-^  yP-y  d^"^ />-/*  (409) 
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Now  in  the  series  of  quantities  a,  /5,  y,  etc.,  there  must  be  some  quantity, 
say  y,  which  is  not  greater  than  either  of  its  two  neighbors.  Let  us  then 
again  rewrite  the  equation  in  the  form 


f  =  ^^'°^ 


id)    "^  f^"'"- ^"'^  ^^'^  ■  ■  ■  ■  l'^'"  (410) 

Since,  however,  y  <.  d  and  y  <^  jH  we  must  conckide  that 


(ir  ^ 


1  (411) 


the  value  unity  occurring  only  if  y  ^  6  or  y  ^  ^.  We  have  thus  re- 
expressed  our  original  series  by  the  product  of  a  quantity  which  cannot 
be  greater  than  unity  and  a  new  series  of  just  the  same  form  but  one 
less  term.  Hence  we  can  proceed  taking  one  term  at  a  time  out  of 
the  series  until  we  have  obtained  a  product  of  quantities  none  of  which 
can  be  greater  tlian  unity. ^  The  logarithm  of  such  a  product,  however, 
must  itself  be  negative  or  zero.    Hence  we  may  conclude  that 

f  <0  (412) 

and  //  is  a  quantity  which  tends  to  decrease  with  the  time  and  reach  a 
minimum  equilibrium  value  only  when  the  distribution  of  molecules  in 
the  different  states  is  such  that 

a  =  ^  =  y  ^  etc. 


or 


Nahh  =  NeNd  =  NeNf  =  ctc.  (413) 


243.    i/-Theorem   and    Maxwell-Boltzmann    Distribution   Law. — 

It  is  immediately  evident  that  the  final  steady  distribution  prescribed 
by  equation  (413)  is  in  entire  agreement  with  the  Maxwell-Boltzmann 
distribution  law  previously  derived  for  the  steady  state.  In  accord- 
ance with  that  law  the  numbers  of  molecules  A^a  and  Nb  are  determined 

en  Ch 

by  their  proportionality  to  the  factors  e  Tf  and  c  k-T  where  £„  and  £& 
are  the  energies  of  the  molecules  in  states  a  and  b  respectively.     Hence 

^  The  reduction  of  the  series  will  finally  leave  us  with  two  terms  of  the  form 
j^-x  yx-v_  Of  the  two  quantities  there  will  be  one  which  is  not  greater  than  the 
other.     Let  us  take  x  as  not  greater  than  31,  and  combine  the  two  terms  in  the  form 


(f ) ' 


^■y-x  yx-y 

y 

Since  x  <i  y,  this  quantity  is  either  unity  or  a   fraction  raised  to  a  positive  power 
and  hence  less  than  unity. 
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fa  +  gft 

the  product   NaNi,   will   depend   on   the   proportionality    factor   e        kT 

€r    +    td 

and  the  product  NcNa  on  the  factor  c  kT  .  The  sum  of  the  energies 
Ea  +  fift  is,  however,  equal  to  the  sum  Sc  +  ^d,  since  the  total  energy  of 
the  two  molecules  cannot  be  changed  by  the  collision.  Hence  the  Maxwell- 
Boltzmann  distribution  law  also  leads  to  equation  (413). 

Moreover,  not  only  is  there  such  an  agreement  between  the  conclusions 
derived  by  the  two  methods  of  attack,  but  the  //-theorem  itself  may  be 
used  for  the  derivation  of  the  Maxwell-Boltzmann  distribution  law,  since 

a    minimum    value    for    H  ^  Ni  log  A^":  -f  A^2  log  A^2  +  ^s  log  Nz-\- 

will  evidently  give  in  accordance  with  equation  (38),  Chapter  4,  a  maxi- 
mum value  to  the  volume  of  the  phase-space  (y-weight)  in  which  the 
representative  point  for  the  system  falls.  And  this  was  the  principle 
used  to  derive  the  Maxwell-Boltzmann  law. 

244,  Critique  of  the  /f-Theorem. — Before  proceeding  we  must  call 
attention  to  the  important  fact  that  the  expression  given  by  equation 
(408)  for  the  change  of  H  with  the  time  is  only  an  expression  for  the 
probable  rate  of  change.  This  arises  from  the  circumstance  that  the 
expressions,  which  we  have  used  for  the  number  of  collisions  of  differ- 
ent kinds  that  take  place  per  second,  are  evidently  merely  expressions 
for  the  probable  number  of  collisions.  Hence  when  H  is  appreciably 
displaced  from  its  minimum  value,  there  is  a  great  probability  that  it 
will  decrease  but  not  a  certainty.  Similarly  when  H  has  reached  its 
minimum  equilibrium  value,  the  result  of  collisions  may  sometimes  lead 
to  a  temporary  increase  in  its  value. 

The  above  consideration  is  of  great  importance  in  removing  mis- 
conceptions as  to  the  behavior  of  molecular  systems.  In  accordance  with 
the  ergodic  hypothesis  the  representative  point  for  a  system  of  mole- 
cules will  in  the  course  of  time  wander  into  all  regions  of  the  phase  space 
(y-space)  permitted  by  the  energy  content  of  the  system,  the  predominant 
duration  of  the  system  in  states  according  with  the  Maxwell-Boltzmann  dis- 
tribution law  being  assured  merely  by  the  relatively  great  fraction  of 
the  total  phase  space  which  corresponds  to  such  a  distribution.  To 
account  for  the  occasional  intervals  when  the  system  does  forsake  the 
Maxwell-Boltzmann  distribution,  we  must  evidently  permit  the  occur- 
rence of  corresponding  occasional  increases  in  the  value  of  H  above  its 
minimum.  We  now  see,  however,  that  this  is  permissible,  since  the  ex- 
pression we  have  obtained  for  the  decrease  of  H  with  time  is  merely  an 
expression  for  the  probable  rate  of  decrease,  and  does  not  rule  out  occa- 
sional increases. 

A  difficulty  sometimes  arises  in  understanding  the  compatibility  of 
the  occasional  increases  permitted  to  H  with  the  great  probability  that 
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H  when  displaced  from  its  minimum  value  will  decrease.  Thus  for 
example  if  Ho  is  the  minimum  value  and  H-i,  Hn  and  H3  appreciably 
higher  values,  and  we  permit  an  occasional  increase  in  H  say  from  Hi 
up  through  Ho  to  H3  and  its  later  return  back  through  Ho  to  Hi,  it 
may  at  first  sight  be  difficult  to  see  why  the  tendency  of  the  system 
to  pass  from  Ho  to  H^  on  the  upward  course  is  not  just  as  great  as  to 
pass  from  Ho  to  Hi  on  the  downward.  The  clarification  of  this  diffi- 
culty is  particularly  due  to  P.  and  T.  Ehrenfest,-  who  have  pointed  out 
that  we  must  regard  the  change  oi  H  with  the  time  as  a  steplike  process 
of  such  a  nature  that  the  attainment  of  any  value,  appreciably  different 
from  Ho,  such  as  Ho,  is  far  more  often  followed  immediately  by  return 
to  a  lower  value  than  by  passage  on  upward  to  a  higher  state.  Thus 
the  change 

Ho  (414) 

Hi  Hi 

will  occur  far  more  frequently  than 

Hs  H3  H2  H2 

H2  or       Ho  or      Hi  (415) 

Hi  Hi 

Hence  in  the  great  majority  of  cases,  when  the  value  Ho  is  attained,  it 
forms  a  sharp  peak  of  the  trajectory  and  is  immediately  followed  by 
return  to  a  lower  value.  Hence  taking  any  given  system  with  the  value 
Ho,  there  will  be  a  great  probability  that  H  will  at  once  move  in  the 
direction  of  a  lower  value. 

These  considerations  also  eliminate  the  difficulty  in  understanding 
what  the  behavior  of  a  system  would  be  if  at  any  instant  all  the  velocities 
should  be  reversed.  If  the  tendency  of  H  is  to  decrease  in  the  forward 
motion,  it  has  sometimes  been  felt  that  the  tendency  in  the  equally  rep- 
resentative reverse  system  would  be  for  H  to  increase,  and  the  system 
to  change  into  more  and  more  improbable  configurations.  This  mis- 
apprehension, however,  is  also  removed  by  the  Ehrenfestian  considera- 
tions, since  in  the  reverse  motion  as  well  as  in  the  forward  motion  high 
values  of  H  occur  in  the  great  majority  of  cases  as  peaks  on  the 
trajectory,  so  that  also  in  the  reverse  motion  there  is  a  great  tendency 
for  a  given  system  with  a  high  value  of  H  to  move  in  the  direction  of 
lower  values. 

245.  Extension  of  the  //-Theorem  to  Include  Radiation. — Boltz- 
mann's  //-theorem,  as  deduced  above,  concerns  itself  with  only  one  of 
several  processes  that  may  be  involved  in  changing  a  system  of  mole- 

^  P.  and  T.  Ehrenfest,  Enc\klop'ddic  dcr  mathcmatisohcn  IVisscnschaftcn,  IV, 
32,  page  42,  Leipzig,  Teubner,  1907-1914. 
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cules,  which  has  not  ah^eady  come  to  equilibrium,  into  a  more  probable 
state.  In  the  first  place,  since  it  considers  a  system  in  which  the  differ- 
ent kinds  of  molecules  present  are  already  uniformly  distributed  through- 
out the  space  available,  the  total  process  involved  in  reaching  equi- 
librium must  include  diffusion,  a  matter  which  will  be  discussed  in  the 
next  chapter.  In  addition,  the  deduction  considers  only  the  effect  of 
molecular  collisions  in  producing  equilibrium,  and,  although  the  method 
of  proof  seems  applicable  without  modification  to  the  newly  discovered 
kinds  of  collisions  which  can  take  place  between  quantized  molecules,  it 
was  not  possible  in  Boltzmann's  time  to  give  any  satisfactory  treatment 
to  processes  involving  the  interaction  of  molecules  with  radiation.  In 
what  follows  we  shall  present  an  extension  of  the  //-theorem  in  a  form 
which  includes  the  emission  and  absorption  of  radiation  by  quantized 
molecules,  as  one  of  the  processes  by  which  equilibrium  is  approached. 
We  shall  not  try  to  consider  more  complicated  cases,  like  the  Compton 
effect  in  which  the  incident  radiation  is  only  partly  absorbed  by  the  elec- 
tron, nor  cases  of  combined  collisional  and  radiational  action.  The  method 
of  treatment  to  be  given  may  seem  a  little  arbitrary  and  artificial,  but 
will  be  of  interest. 

Let  us  consider  a  system  of  unit  volume  containing  N^,  No,  N3,  etc., 
molecules  in  different  quantum  states  and  )h2,  ^hs,  «23,  etc.,  light  quanta 
of  the  frequency  necessary  for  raising  the  molecules  by  absorption  from 
one  quantum  state  to  another  as  indicated  by  the  subscripts.  Further- 
more, let  us  indicate  by  nhn,  «ii3,  «^23,  etc.,  the  number  of  modes  of  electro- 
magnetic vibration  which  on  the  basis  of  the  classical  theory  would  be 
the  carriers  for  radiation  of  the  frequency  indicated.  We  shall  now  define 
H  for  the  combined  system  of  molecules  and  quanta  by  the  equation 

H  =  N,  log  A^i  +  A^o  log  A^o  -f  A^g  log  AA3  +  .  . . 
-f  «i2  log  ;?i2  +  ni3  log  ^13  +  W23  log  «.3  +  .  .  . 
—  ("i2  +  "^12)  log  (ni2  -f  71112)  — (mi3  +  W13)  log  (;/i3  +  "hs) 

—  (W23   -f-   llhs)     log     («23   +   "^23)    ... 

=  1iNi\ogNi-\-  l^nijlogtiij  —  '2(nij -^  niij)  log  (nij -\- lUij)    (416) 

»■  1;  ij 

This  definition  of  H  may  seem  somewhat  arbitrary.  The  terms  depend- 
ing on  the  numbers  of  molecules  in  different  states  is,  however,  the  same 
as  that  in  Boltzmann's  definition  of  H,  and  the  choice  of  the  new  terms 
depending  on  the  number  of  quanta  of  different  kinds  present  will  be 
justified  by  the  results. 

We  must  now  calculate  the  change  of  //  with  the  time.  We  obtain 
by  differentiation 
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or  since  the  second  term  on  the  right  hand  side  is  evidently  zero  we 
obtain  by  cancellation 

^^S^logiV.  +  S^log        7^  (417) 

dt         i     dt      ^         ^    ij     dt       ^  ni)  +  ma  ^       ^ 

It  is  at  once  evident  that  molecular  collisions  will  lead  to  a  decrease 
of  H  with  the  time,  unless  the  minimum  value  has  already  been  reached, 

since  the  terms  of  the  form  -7—   log  N i  are  the  same  as  in  the  original 

derivation  of  the  //-theorem  and  the  remaining  terms  are  not  afifected 

by  collision.     Hence  we  have  only  to  consider  the  effect  of  the  absorp- 

,  dH 
tion  and  emission  of  radiation  on  the  value  of  —7-. 

at 

Consider  the  absorption  process  by  which  a  molecule  goes  from  state  ♦ 
to  i  and  the  associated  emission  by  which  it  goes  from  /  to  i.  Using 
for  the  rates  of  these  two  processes  Einstein's  expressions  as  given  in 
Sections  208  and  210  in  Chapter  16,  we  evidently  obtain  as  the  net  re- 
sult of  these  two  processes 

dN  i  dN  i  dtlii  T,r/7->  ,  ^\  Ar7^  /^10\ 

-^= dt^~W^  Nj{BjiUij  +  Aji)  —NiBijUi}       (418) 

where   Ua   is  the   radiation  density  of   the   frequency   involved,   and  the 

reason    is    evident    for    setting    the    net    rate  —r—       at    which    molecules 

dt 

dfiij 
drop  from  /  to  i  equal  to  the  rate  of  production     ,       of  the  correspond- 
ing quanta. 

Comparing  equations  (417)  and  (418),  it  is  evident  that  the  absorp- 
tion and  emission  process  considered  will  contribute  three  terms  to  the 
rate  of  change  of  H  with  the  time,  which  can  be  combined  in  the  form 

^H        TAT/o  ,     ^    N        AT  n         11  ^i'^hj  (419) 

-^  =  [y,(ft,».,  +  Aj,)  -  iV.B.,»„]  log  ^^^^^^  _^  ^^^^^ 

The  quantity  nij/^nij  -\-  mij)  can,  however,  easily  be  put  into  a  better 
form  for  our  purposes,  na  is  the  number  of  quanta  present,  of  the 
correct  frequency  Vij  to  produce  the  change  from  i  to  /,  and  ma  is  the 
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number   of   modes   of   electromagnetic   vibration   which  on   the   classical 
theory  carry  energy  of  this  frequency,  so  that  we  can  write 


, — —  ov 
tiij        hvij  Uij 


Hij  4-  mij         Uij    „      ,    Snvij^  .  ,    Snhvif 

dv-] -r—   dv  Uij  -\ -r— 


(420) 


llVu 


or  substituting  the  relations  between  Ba,  Bji  and  A  a  given  by  equation' 
(357)  in  Chapter  16,  we  obtain 


Uij  -\-  niij       BjiUij  -)-  Aji 

Substituting  in    (419)    and   rearranging,   we   obtain  as   that   part   of 

dH         .       . 

—J—  which  is   due   to  the  processes   of   absorption   and   emission   under 

discussion 

dt    -^""^[NjiBjiUij  +  Aji))  ^^     ^ 

This  expression,  however,  is  of  the  form  log  (x/y)"'^,  and  this  is  neces- 
sarily negative  unless  x  ^  y,  when  it  becomes  zero.  (See  footnote,  page 
196.)  Hence,  since  all  the  possible  processes  of  absorption  and  emission 
can  be  combined  in  similar  pairs,  it  is  evident  that  the  interaction  of 
matter  and  radiation,  as  well  as  the  action  of  collisions,  will  lead  to  a  prob- 
able decrease  of  H  with  the  time  until  the  minimum  value  is  obtained, 
and  when  this  occurs  we  shall  have  an  equality  between  the  rates  of 
emission  and  absorption  of  the  quanta  of  any  frequency  capable  of  such 
interaction.  We  have  thus  obtained  the  desired  extension  to  Boltzmann's 
//-theorem. 

246.  Combined  Deduction  of  Maxwell-Boltzmann  and  Planck 
Distribution  Laws. — It  is  also  of  interest  to  verify  the  fact  that  the 
attainment  of  the  minimum  value  of  H  will  lead  to  a  condition  of  the 
system  in  which  we  have  both  the  Maxwell-Boltzmann  law  for  the  dis- 
tribution of  the  molecules  in  different  states  and  the  Planck  law  for  the 
distribution  of  the  radiation. 

In  accordance  with  our  equation  of  definition  (416),  we  have  for  H 
the  value 

H  =  ^Ni  log  Ni  +  2«iy  log  nij  —  ^(na  -f  mij)  log  (juj  -\-  niij)    (423) 

'The  a-priori  probabilities  pi  and  pj  must  be  taken  as  equal,  in  agreement  with 
the  convention  we  have  adopted  in  Chapter  15  for  assigning  equal  regions  of  phase 
space  to  different  states  of  a  molecule  when  coUisions  are  to  be  considered. 
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and  if  for  simplicity  we  consider  only  one  kind  of  molecules  *  we  have  for 
the  total  number 

A^  =  2A^i  (424) 

i 

while  for  the  total  energy  of  the  system  E  we  evidently  have  the  value 

£  =  2A^£;  +  ^nijhvii  (425) 

i  ij 

To  obtain  the  minimum  value  of  H  we  may  set  its  variation  equal 
to  zero.  The  variation  can  not,  however,  be  carried  out  entirely  arbitrarily, 
since  the  total  energy  and  total  number  of  molecules  must  be  preserved 
constant.  Hence  setting  the  variations  for  all  three  expressions  (423), 
(424),  and  (425)  equal  to  zero  and  combining  by  the  familiar  method 
of  undetermined  multipliers,  we  may  obtain 

0  --=  XdNi  log  Ni  +  ^diiij  log 


+  ^y^Ni  (426) 

i 

+  lii'EeidNi  -j-  /uliliVijduij 
»  i; 

where  2.  and  //  are  undetermined  multipliers.  This  equation  can  be  re- 
written in  the  form 

2(log  Ni  +  X  +  fiei)dNi  +  S  ("  log '-^ +  /!/»',■;  )  dnu  =  0  (427) 

and  since  the  introduction  of  the  undetermined  multipliers  has  made  the 
variations  arbitrary,  we  must  have  for  each  state  /  and  each  frequency  Vij, 
the  result 

log  Ni-^?.^  inei  =  0    ;   Ni  =  e'^  e-i'''  (428) 

log 'hi ^^,/n',-,-  =  0     ;    '^ ^^-MA..>  (429) 

Substituting,  however,  the  familiar  value  //  =  l/kT,  and  noting  that 
e~^  must  have  such  a  value  that  SA/",-  =  A^  we  easily  change  equation 
(428)  over  into  the  familiar  Maxwell-Boltzmann  form 

N.  =  ^  (430) 

^  e~kf 

i 

*  The  extension  to  more  than  one  kind  of  molecules  is  simple  and  obvious. 
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while,  substituting  for  na/ {nij  -\-  ma)  the  expression  given  by  equation 
(420)  and  transforming  we  obtain  the  famihar  form  of  the  Planck  dis- 
tribution law 

"./  =  ^l^  (431) 

e  kT  —  1 

We  thus  see  that  the  attainment  of  the  minimum  value  of  H  will  lead 
both  to  the  Maxwell-Boltzmann  and   Planck  laws. 

As  indicated  by  the  subscripts  in  equation  (431)  our  proof  con- 
cerns only  those  frequencies  which  can  be  absorbed  or  emitted  by  the 
molecules  present,  and  in  an  actual  system  containing  only  one  kind  of 
molecules,  we  must  often  expect  that  the  walls  will  play  a  considerable 
part  in  the  attainment  of  radiation  equilibrium. 

247.  Conclusion. — We  cannot  conclude  this  chapter  without  an  ex- 
pression of  admiration  for  Boltzmann's  masterful  achievement  in  dis- 
covering and  deducing  the  //-theorem,  even  though  his  deserved  and 
honored  fame  needs  from  us  no  words.  The  significance  and  originality 
of  inventing  a  function  which  should  measure  the  displacement  of  a 
system  of  molecules  from  statistical  equilibrium,  and  the  ingenuity  and 
elegance  of  the  proof  that  this  function  will  decrease  with  the  time,  alike 
compel  attention. 

The  analogy  between  increase  in  the  thermodynamic  quantity  entropy 
and  decrease  in  the  statistical  mechanical  quantity  H  is  evident.  And 
the  attempts  to  relate  the  entropy  S  to  the  negative  of  //  should  not  be 
overlooked.  The  new  statistical  mechanical  quantity,  however,  certainly 
gives  the  greater  insight  into  the  inner  mechanism  of  physical -chemical 
changes. 

The  cosmological  implications  of  his  theorem  were  also  appreciated 
by  Boltzmann.  In  whatever  part  of  the  whole  universe  or  instant  in 
time  we  may  find  ourselves,  there  is  an  enormous  probability  that  the 
value  of  H  will  be  decreasing  with  the  time  unless  it  has  already  reached 
its  minimum  value,  and  we  shall  label  forward  in  time  that  direction  in 
which  H  decreases  and  energy  is  dissipated,  and  perhaps  look  longingly 
backward  to  the  good  old  days  when  there  was  a  greater  store  of  un- 
dissipated  energy.  But  in  the  course  of  time  enormous,  however,  com- 
pared with  that  which  we  label  aeons,  the  value  of  H  may  again  increase 
and  prepare  for  a  new  cycle  of  significant  events. 

In  the  following  chapters  we  shall  discuss  more  specific  cases  of  the 
rate  at  which  equilibrium  is  approached  in  some  selected  physical-chemical 
processes. 


Chapter  19 

The  Rates  of  Some  Physical  Processes 
Transport  Problems 

248.  Introduction.' — In  the  present  chapter  we  shall  consider  the 
rates  of  some  important  physical  processes  which  are  often  encountered 
in  physical  or  chemical  experimentation.  The  processes  to  be  discussed 
are  those  which  concern  the  transport  of  matter,  energy,  momentum  or 
electricity  by  molecular  systems. 

These  processes  take  place  of  course  in  systems  which  are  not  in  a 
state  of  equilibrium,  and  hence  are  examples  of  the  general  tendency 
of  molecular  systems  to  change  in  the  direction  of  more  probable  con- 
figurations. Since  the  systems  under  consideration  are  not  in  equilibrium 
but  in  a  "steady  state"  of  change,  the  molecules  of  course  do  not  have 
their  coordinates  and  momenta  distributed  according  to  the  Maxwell- 
Boltzmann  law,  and  the  most  fundamental  method  of  treating  these  prob- 
lems is  to  investigate  the  actual  distribution  which  the  coordinates  and 
momenta  do  assume  and  thus  determine  the  behavior  of  the  system  dur- 
ing the  change  that  occurs.  Unfortunately  this  method  is  usually  one  of 
great  mathematical  complexity  and  often  involves  a  great  deal  of  in- 
formation or  hypothesis  as  to  the  dynamics  of  molecular  interaction. 
Hence  we  shall  employ  instead  the  methods  of  the  older  kinetic  theory 
which  treat  the  system  as  though  there  were  merely  a  superimposed 
disturbance  which  is  transmitted  by  molecules  having  the  Maxwell- 
Boltzmann  distribution.  The  effect  of  the  superimposed  disturbance  in 
changing  the  Maxwell-Boltzmann  distribution  is  thus  neglected  and  this 
sometimes  introduces  errors  which  can  be  shown  to  be  of  appreciable 
magnitude.  On  the  other  hand  the  uncertainty  of  the  physical  hypotheses 
that  must  be  arbitrarily  introduced,  at  our  present  state  of  knowledge, 
rarely  justifies  the  more  fundamental  but  more  difficult  methods  of  attack.^ 

Since  the  material  in  this  chapter  is  somewhat  foreign  to  the  main 
theme  of  the  book  and  is  introduced  mainly  for  convenience  of  reference, 

^As  an  important  example  of  the  use  of  more  fundamental  methods  see  S. 
Chapman,  Philosophical  Trans,  of  the  Royal  Society,  216,  279  (1916).  The 
more  fundamental  methods  employed  in  this  work  seem  justified  by  the  agreement 
that  is  now  obtained  between  values  of  molecular  diameters  calculated  from 
viscosity  and  from  van  der  Waals  b. 

204 


248-249]  TRANSPORT  PROBLEMS  205 

we  shall  in  some  cases  content  ourselves  merely  with  a  statement  of 
results,  the  deductions  for  which  can  be  found  in  the  literature.  A 
considerable  number  of  deductions  will  be  included,  however,  to  illustrate 
the  methods.     As  already  noted  the  results  are  not  always  exact. 

We  shall  first  consider  the  transport  of  matter  from  one  region  to 
another  in  both  of  which  the  molecules  may  be  regarded  as  sensibly  in 
statistical  equilibrium,  the  two  regions  being  separated  by  a  negligible 
layer  where  an  abrupt  change  in  properties  takes  place.  {Effusion, 
ezKiporation,  solution. ) 

We  shall  then  consider  the  transport  of  matter,  momentum,  and 
energy  in  a  gas  under  the  simplified  conditions  that  the  density  is  low 
enough  so  that  the  collisions  between  molecules  can  be  neglected  com- 
pared with  collisions  with  the  walls  of  the  apparatus.  {Phenomena  of 
nwlecular  Hozv.) 

Finally  we  shall  consider  the  transport  of  matter,  energy,  momentum 
and  electricity  through  regions  where  there  is  a  continuous  variation 
in  conditions  as  we  proceed  in  the  direction  of  flow,  and  the  concentra- 
tion is  so  high  that  the  rate  of  flow  is  determined  by  the  collisions  of 
the  molecules  among  themselves.  {Diffusion,  thermal  conduction,  viscos- 
ity, electrical  conduction  as  dependent  on  the  phenomenon  of  mean  free 
path.) 

249.  General  Equation  for  the  Rate  of  Transport  of  Matter 
Across  a  Boundary  in  a  Gas  at  Equilibrium. — If  we  consider  a  gas 
in  equilibrium,  with  the  molecular  velocities  distributed  in  accordance 
with  Maxwell's  law,  we  can  easily  calculate  the  number  of  molecules 
which  will  cross  any  boundary  surface  in  the  gas  by  considering  the 
average  component  of  molecular  velocity  perpendicular  to  this  surface. 
In  the  case  of  a  gas  in  equilibrium,  this  flow  will  of  course  be  com- 
pensated by  an  exactly  equal  flow  in  the  opposite  direction.  But  in  the 
case  the  surface  is  a  thin  layer  of  discontinuity  we  can  often  obtain 
information  as  to  the  net  rate  of  transfer  of  material  across  this  boundary, 
with  the  help  of  such  a  calculation. 

For  convenience  let  us  consider  a  surface  of  one  square  centimeter 
perpendicular  to  the  .ar-axis.  For  the  number  of  molecules  per  cubic 
centimeter,  with  component  velocities  in  the  range  dx  dy  dz,  we  have  in 
accordance  with  Maxwell's  law  as  given  by  equation  (90)  in  Chapter  5 

dN  =  N  (  2"^  y%  -  IFr  ^■'"  +  ^'  +  '"-^  dxdydz  ( 432 ) 

where  A^  is  the  total  number  per  cubic  centimeter. 

Components  of  velocity  in  the  y  and  s  directions  will  have  no  effect 
on  the  passage  of  molecules  across  the  boundary  in  question,  so  that  a 
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parallelopiped  of  gas  of  volume  1  X  1  X  -v  niay  be  regarded  as  supply- 
ing the  molecules  of  velocity  x  which  cross  the  boundary  per  second.'^ 

Hence  for  the  total  number  of  molecules  crossing  the  boundary  per 
square  centimeter  per  second,  we  may  evidently  write  the  integral 

+   =0       +   OC'     +    CO 

Z^    (   ^   ^  N  (j^)  ^^~  ^^"  ^  ^  "^  "^  xdxdyd^     (433) 

/D      —  X         0 

The  indicated  integrations  can,  however,  easily  be  performed  giving  us 

where  M  is  the  ordinary  molecular  weight.^ 

Comparing  equation  (434)  with  equation  (95)  in  Chapter  5  for  the 
average  velocity  Q  we  see  that  we  can  also  write  the  above  expression  in 
the  form 

Z  =  ^Q  (435) 

a  result  which  was  perhaps  obvious,  since  the  average  component  in  any 
direction  of  a  velocity  Q,  distributed  in  random  directions,  is  easily 
shown  to  be  Q/4. 

Substituting  the  gas  laws  in  the  form 

N  =  {^  (436) 

we  can  obtain 


(p  in  mm.  of  mercury) 
or  for  the  mass  of  gas  crossing  the  boundary  per  second 

Zm  =  p  ^j^  =  58.32  X  10-=*  p  ^^  gm.  cm.-=^  sec.-^    (438) 
(p  in  mm.  of  mercury) 

These  equations  will  be  found  quite  useful. 

*  Molecules  leaving  the  parallelopiped  because  of  their  i;  and  a  components  of 
velocity  are  evidently  substituted  for  by  those  entering  the  parallelopiped.  In 
addition  collisions  occurring  within  the  boundary  have  no  net  effect. 

^A  number  of  numerical  factors  given  in  this  chapter  have  been  taken  from 
the  very  useful  book  of  Dushman,  "High  Vacuum,"  Schenectady,  1922. 


249-251]  TRANSPORT  PROBLEMS  207 

250.    Rate  of  Effusion  of  Gas  through  a  Small  Opening. — If  we 

consider  now  two  samples  of  gas  at  different  pressures  separated  by  a 
thin  partition  with  a  small  opening  of  area  S,  small  enough  so  that  we 
can  regard  the  gas  on  each  side  as  under  substantially  equilibrium  condi- 
tions, we  can  calculate  the  flow  through  the  opening  as  the  net  result 
of  the  passage  of  gas  in  the  two  directions  as  given  by  equation  (437), 
and  obtain  for  the  net  number  of  molecules  passing  per  second  * 


Z  =  Z..-i.=  (p..-p.)Si\j±^  (439) 

If  we  desire  the  flow  expressed  in  terms  of  the  volume  measured  at 
unit  pressure  discharged  per  second,  we  obtain 


RT 


Q  =  ZkT  =  (^  -  P.)S^^^  =  (/..  -  p.)S  ^2,,, 
For  a  circular  opening  of  diameter  D,  this  gives  us 


(440) 


Q  =  (/'.-/'i)^y2^  =  2864(/>.-^)I^^^       (441) 

where  D  is  in  centimeters  and  the  voluume  Q  per  second  is  in  cubic 
centimeters  measured  at  temperature  T  and  unit  pressure  on  the  scale 
used  for  p2  and  pi. 

These  equations  of  course  only  hold  when  the  opening  is  so  small 
that  we  can  regard  the  gas  as  substantially  under  equilibrium  conditions 
on  the  two  sides  of  the  system.  This  will  be  true  when  the  diameter 
of  the  hole  is  small  compared  with  the  mean  free  path  of  the  molecules. 

251.  Thermal  Effusion. — The  method  of  calculation  can  be  applied 
equally  well  when  the  gas  on  the  two  sides  of  the  partition  is  at  different 
tetnperatures.  We  then  obtain  from  equation  (437)  for  the  number  of 
molecules  passing  per  second 


/i    Z^2  -^1   


y/2nmk\VT\       VtJ 


and  for  zero  net  rate  of  flow,  we  obtain  the  result 

p2  _  VfT 


(443) 


If  we  consider  an  infinitesimal  change  in  temperature  across  a  boundary 

■'This    relation    was    first    discovered    and    experimentally    verified    by    Knudsen, 
Anyu  der  Physik,  28,  999  (1909). 
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in  a  tube  connecting  two  reservoirs  at  different  temperatures,  we  obtain 
from  equation   (443) 

d  log  p  =  ^  dlogT  (444) 

and  hence  by  integration  obtain  the  same  equation 

^  =  ^  (445) 

pi       VTi 

for  the  steady  condition  of  no  net  flow,  even  when  the  temperature  drop 
occurs  gradually  in  a  tube  connecting  two  reservoirs  at  temjDeratures 
T2  and  Ti. 

This  interesting  relation  was  first  discovered  and  tested  by  Knudsen,^ 
and  shows  that  at  pressures  Ion}  enough  so  that  the  mean  free  path  is 
large  compared  u^itJi  the  dimensions  of  the  connecting  tube,  the  steady 
state  of  a  gas  in  two  connecting  reservoirs  is  not  one  of  uniform  pressure 
throughout,  but  that  the  reservoir  at  higher  temperature  will  also  be  at 
the  higher  pressure.  The  analogy  with  the  Soret  effect  in  solutions 
should  be  noted. 

252.  Rate  of  Evaporation. — Our  simple  equation  (434)  for  the 
number  of  molecules  reaching  a  given  surface  can  also  be  used  to  investi- 
gate the  rate  of  evaporation  from  a  solid  or  liquid.  If  we  consider  a 
surface  of  one  square  centimeter  of  some  material  in  equilibrium  with 
its  vapor,  the  number  of  molecules  reaching  that  surface  per  second  from 
the  gas  phase  will  be 


N 


where  po,  being  the  pressure  of  the  gas  at  equilibrium,  is  the  vapor  pres- 
sure of  the  material. 

If  r  is  the  fraction  of  these  molecules  which  are  reflected  on  striking 
this  surface,  and  (1  —  r)  the  fraction  that  condense,  then  it  is  evident 
that  to  maintain  equilibrium  the  rate  of  eva^xjration  per  square  centimeter 
must  be 


«VS  (447) 


^evap.  =    (1  r)p 

and  since  we  can  in  general  expect  the  rate  of  evaporation  to  be  inde- 
pendent of  the  pressure  of  vapor,  equation  (447)  can  be  used  directly 
for  the  rate  of  evaporation  in  to  a  vacuum. 

Experimentally  the   fraction  r  of   molecules  reflected   is  often   found 
to   be   very    small.      Thus    in   the    case   of    a   clean    surface   of    mercury 

"  Knudsen,  Ann.  der  Physik,  31,  205   (1910).     See  also  for  a  more  satisfactory 
demonstration  Smoluchowski,  Ami.  der  Physik,  33,  1559  (1910). 
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Knuclsen  "^  found  that  less  than  one  per  cent  of  the  impinging  mole- 
cules were  reflected.  This  makes  it  possible  as  a  first  approximation 
to  neglect  the  value  of  r  in  equation  (447)  in  comparison  with  unity 
and  use  the  rate  of  evaporation  into  a  vacuum  to  determine  the  vapor 
pressure  po  of  highly  involatile  substances.  This  has  been  done  by 
Langmuir  '  and  Langmuir  and  McKay  ^  for  tungsten,  platinum,  and 
molybdenum.  The  method  is  a  very  beautiful  and  simple  one  for  the 
solution  of  problems  that  could  not  otherwise  be  attacked. 

Since  the  current  from  a  hot  filament  measures  the  rate  of  evapora- 
tion of  electrons  from  it,  equation  (447)  can  also  be  used  to  obtain  a 
relation  between  the  value  of  the  saturation  current  from  a  filament  and 
the  vapor  pressure  of  the  electron  gas  which  would  be  in  equilibrium 
with  it  at  the  temperature  under  consideration.  This  has  made  it  pos- 
sible to  apply  thermodynamic  reasoning  to  the  problem  of  thermal  emis- 
sion ^  and  to  show  that  the  entropy  of  electron  gas  is  given  by  the  same 
formula  as  for  other  monatomic  gases. ^° 

For  the  net  rate  of  evaporation  into  a  vessel  containing  vapor  at 
pressure  p,  we  should  evidently  expect  from  equation  (446)  to  obtain  the 
value 


Zr...=  {^-r)^j^iP.-p) 


(448) 


253.  Rate  of  Solution. — The  process  of  solution  is  analogous  to 
that  of  evaporation.  We  have,  however,  imperfect  information  as  to  the 
velocities  of  the  solute  molecules  and  the  nature  of  the  boundary  between 
solid  and  solution.  Nevertheless,  we  are  not  surprised  to  find  that  the 
net  rate  of  solution  is  proportional  to  the  difiference  between  the  con- 
centration at  saturation  Co  and  the  instantaneous  concentration  C  in 
accordance  with  the  equation 

^-£  =  K{Co-C)  (449) 

which   is    entirely   analogous    to    equation    (448)    for    evaporation.     The 
equation  was  first  tested  experimentally  by  Noyes  and  Whitney.^^ 

254.  Transfer  of  Momentum  at  Very  Low  Pressures. — Let  us  now 

turn   our  attention  to  some  problems   concerning  the  behavior   of   gases 
at  very  low  pressures,  which  can  be  treated  quite  simply  if  the  density 

"Knudsen,  Ann.  der  Physik,  47,  697   (1915). 
'  Langmuir,  Phys.  Rev.,  2,  329  (1913). 
'Langmuir  and  McKay,  Phys.  Rev.,  4,  Z77  (1914). 

'  See  Richardson,  "The  Emission  of  Electricity  from  Hot  Bodies,"  Longmans 
Green,  London,  1921. 

"See  Tolman,  Jotir.  Amer.  Chem.  Soc.,  43,  1592  (1921). 
"Noyes  and  Whitney,    Zeits.  f.  Physik.  Cheniie,  23,  689  (1897). 


210  STATISTICAL  MECHANICS  [Ch.  19 

is  low  enough  so  that  the  collisions  between  molecules  can  be  neglected 
compared  with  collisions  with  the  walls  of  the  apparatus. 

Consider  two  parallel  plates  A  and  B  separated  by  a  distance  d  and 
immersed  in  a  gas  at  low  enough  pressure  so  that  the  mean  free  path  is 
long  compared  with  the  separation  d.  Then  if  the  molecular  velocities 
are  assumed  to  be  distributed  in  accordance  with  Maxwell's  law  with  an 

average  value  Q,   it   is   evident,   since  y    will   be   the   average   velocity 

towards  one  plate  or  the  other,  that  a  molecule  in  the  space  between 
the  plates  will  make  on  the  average  Q/Ad  "round  trips"  per  second  be- 
tween the  plates.  And  if  the  concentration  of  molecules  per  cubic  centi- 
meter is  A^,  the  total  number  of  round  trips  per  second  per  square  centi- 
meter of  plate  area  will  be 

Z  =  Nd-ri=—r-  (450) 

4d         4 

Suppose  now  that  one  of  the  plates  A  is  in  motion  in  its  own  plane 
with  the  velocity  u,  then  if  we  assume  that  each  molecule  striking  A 
itself  acquires  a  component  of  velocity  u  in  the  direction  of  motion  of  the 
plane,  but  that  otherwise  the  Maxwell  distribution  is  not  disturbed,  we 
shall  have  for  the  momentum  transferred  per  square  centimeter  per  second 

M  =  -j--mu  (451) 

and  this  of  course  is  the  force  per  square  centimeter  of  surface  necessary 
to  keep  the  plane  A  in  its  state  of  steady  motion.  Hence  substituting 
for  Q  its  value  as  given  by  equation  (95),  Chapter  5, 

Q  =  JW.  (452) 

l'    711)1 

we  have  for  the  force  per  square  centimeter  necessary  to  keep  the  plane  A 
in  motion  parallel  to  B,  with  the  velocity  u 

„       Nmu   jSkT        ^  ^j    m  //ie-5\ 

Unlike  the  condition  of  affairs  in  ordinary  viscosity  experiments  with 
gases  at  higher  pressures,  as  will  be  seen  in  Section  267,  the  force  is 
independent  of  the  distance  between  the  plates,  and  at  a  given  tempera- 
ture is  proportional  to  the  pressure  of  the  gas  instead  of  independent  of  the 
pressure. 

Of  course  if  the  molecules  on  striking  plate  A  do  not  receive  on  the 
average  the  component  of  velocity  u  the  same  as  the  velocity  of   the 
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plate,  the  amount  of  momentum  transferred  will  be  smaller  and  the  force 
itself  smaller.  As  a  result  of  experiments,  however,  Knudsen  believes 
that  the  molecules  do  acquire  on  the  average  substantially  the  velocity 
of  the  plate. 

The  principle  exemplified  by  equation  (453)  has  been  used  by 
Dushman  ^^  in  constructing  a  gauge  for  measuring  low  gas  pressures, 
in  which  momentum  is  transferred  from  a  rotating  disk  to  parallel  disk 
hung  above  it  by  a  quartz  fibre.  If  the  molecules  do  acquire  on  the 
average  the  velocity  of  the  moving  disk  the  instrument  could  be  made 
an  absolute  one,  but  in  any  case  we  should  expect  the  torque  to  be  pro- 
portional to  pVnv/kT  u.  The  instrument  has  been  used  for  measuring 
pressures  as  low  as  10'^  mm.  of  mercury. 

255.    Flow  of  Gas  Through  a  Tube  at  Very  Low  Pressures. — 

The  foregoing  method  can  be  applied  to  investigate  the  flow  of  gases 
through  tubes.  If  the  pressure  is  low  enough  so  that  we  can  neglect 
the  collisions  between  molecules  as  compared  with  collisions  with  the  walls, 
the  only  momentum  lost  by  the  gas  flowing  through  the  tube  will  be 
that  transferred  to  the  walls,  and  we  can  equate  this  loss  of  momentum 
per  second  to  the  force  exerted  on  the  moving  gas  by  the  pressure  drop 
along  the  tube.  If  we  assume  then  that  the  molecules  moving  through 
the  tube  have  the  Maxwell  distribution  of  velocities,  with  an  average 
superimposed  velocity  ii,  we  can  write  with  the  help  of  equation  (434), 
for  a  tube  of  diameter  D,  the  expression 


37D2  /  kT 

^dp  =  muN  V#^  nDdl  (454) 

where  dp  is  the  drop  in  pressure  in  the  length  d!,  inu  is  the  momentum 
transferred  per  collision,  and  A^ VkT/Znm  nDdl  is  the  number  of  collisions 
per  second  in  the  length  dl.  In  a  state  of  steady  flow,  however,  the  product 
Nu,  which  is  the  number  of  molecules  passing  any  boundary  per  square 
centimeter  per  second,  must  be  a  constant  along  the  length  of  the  tube, 
so  that  we  can  easily  integrate  over  a  total  length  L  corresponding  to  the 
pressure  drop  {p^  —  P2)  and  obtain 


-^  ipi  —  p2)  =Nu^-^  jiDL  (455) 

For  the  total  number  of  molecules  passing  any  boundary  in  the  tube  per 
second  we  evidently  have  NimD^/A  or  for  the  volume  Q  measured  at  unit 
pressure  kT  times  this  quantity,  so  that  by  substituting  in  (455)  and 
solving  for  Q  we  can  obtain 

"Dushman,  Phys.  Rev.,  5.  212  (1915). 
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The  derivation  is  evidently  not  exact  since  we  have  arbitrarily  treated 
the  molecules  as  having  the  Maxwell  distribution  of  velocities  with  a  uni- 
formly superimposed  velocity  u  along  the  tube.  Knudsen/^  by  assuming 
that  the  molecules  of  high  absolute  velocity  c  have  a  proportionally  high 
superimposed  velocity  u,  obtains  the  slightly  different  result 

and  this  same  result  is  also  obtained  by  Smoluchowski  ^*  using  the  more 
fundamental  methods  of  investigating  the  actual  form  of  the  law  for  the 
distribution  of  the  velocities.  Smoluchowski  also  considers  the  possibility 
that  only  a  fraction  /  of  the  molecules  are  diffusely  reflected  from  the 
walls,  losing  on  the  average  the  component  of  velocity  ii,  the  remainder 
(1  —  /)  being  specularly  reflected.  In  such  a  case  the  rate  of  flow  would 
of  course  be  greater  than  given  by  (457),  and  this  is  found  to  be  in 
the  ratio  (2  —  /)//•  Equation  (457)  is  found,  however,  to  express  the 
facts  at  low  pressures  extremely  well.  At  higher  pressures  the  flow 
through  tubes  is  governed  by  Poiseuille's  law.  (See  Section  268.) 
Equations  (441)  and  (457)  are  sometimes  combined  in  the  form 

9.118X10=^       ,  ,     It  ,,-o- 

Q  -  2.394  L   /  3.184  ^^--^-Wm  ^'^^^ 

to  calculate  the  flow  through  a  tube  connecting  two  reservoirs  at  pres- 
sures p2  and  px,  allowing  both  for  the  resistance  at  the  entrance  to  the 
tube  and  the  resistance  of  the  tube  itself.  All  these  equations  hold  of 
course  only  at  low  enough  pressures  so  that  the  mean  free  path  is  large 
compared  with  the  diameter  of  the  tube.  Q  is  the  volume  discharged 
per  second  measured  at  temperature  T  and  at  unit  pressure  on  the  same 
scale  as  p2  and  pi. 

256.  Transfer  of  Momentum  at  Very  Low  Pressures  between 
Plates  at  Different  Temperatures. — Similar  methods  can  also  be  used 
to  investigate  the  behavior  at  low  pressures  of  molecules  between  parallel 
plates  at  different  temperatures  Ti  and  To.  We  may  consider  the  mole- 
cules between  the  plates  as  divided  into  two  groups  A^^i  per  cubic  centi- 
meter which  have  rebounded  from  the  hotter  plate  at  temperature  Ti 
and  are  moving  towards  the  colder  plate  at  T^,  and  A^2  moving  from  the 

"Knudsen,  Ann-,  dcr  Phys.,  28.  75  (1909). 
"Smoluchowski,  Ann.  der  Phys.,  33,  1559  (1910). 
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colder  plate  towards  the  hotter  plate.  If  we  assume  as  a  first  approxima- 
tion that  these  molecules  have  velocities  distributed  in  accordance  with 
Maxwell's  law,  with  average  velocities  respectively  Q^  and  Qo,  we  can  evi- 
dently write 

N^Q,  =  N,Q,  =  ^0  (459) 

where  A''  and  Q  are  the  number  of  molecules  per  cubic  centimeter  and 
average  velocity  for  the  gas  surrounding  the  plates,  equation  (459)  merely 
expressing  the  requirement  that  the  number  of  molecules  moving  from 
one  plate  to  the  other  must  be  balanced  by  the  number  returning,  and 
that  the  number  escaping  from  between  the  plates  must  also  be  balanced 
by  the  number  returning  from  the  neighboring  gas  if  we  are  to  maintain  a 
steady  state. 

For  the  pressure  between  the  plates  we  can  evidently  write  in  accord- 
ance with  equation  (105)   Chapter  5,  the  expression 

pint.  =  -Y  ^'^C*!-  -f  -^  mC.'  (460) 

and  for  the  pressure  outside  the  plates 

p  =  jmG'  (461) 

where  Gi,  G-.  and  G  are  the  mean  square  velocities  corresponding  to  the 
average  velocities  Q^,  Qo  and  Q.  Hence  we  shall  obtain  per  square 
centimeter  of  plate  surface  an  unbalanced  force  of  amount 

F  =  ~  mG^  +  ^  mG^  —  y  mG^  (462) 

tending  to  force  the  plates  apart,  or  substituting  equations  (459)  and 
(461),  this  can  easily  be  rewritten  in  the  form 

F^p[  ]^?L^_^_i   I  (463) 

1} 

In  accordance,  however,  with  our  assumption  of  the  Maxwellian  distribu- 
tion, we  should  have 

—  =  —=:-  (464) 

and  may  rewrite  (463)  in  the  form 
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^  =  ,(lG^_l)  (465) 

The  interpretation  of  equation  (465)  would  now  be  very  simple,  if 
the  molecules  rebounding  from  the  two  plates  at  Ti  and  T2  could  be  re- 
garded as  assuming  the  mean  square  velocities,  which  can  be  denoted  by 
the  primed  letters  Gi'  and  G^',  which  correspond  to  the  temperatures  Ti 
and  T2  in  accordance  with  the  equation 

%  =  ^  (466) 

Experiments  on  the  conduction  of  heat,  however,  have  shown  this  not  to 
be  the  case  but  that  the  molecules  rebounding  from  the  cold  plate  have 
not  dropped  to  as  low  a  velocity  as  would  be  required  and  vice  versa 
those  rebounding  from  the  hot  plate  have  not  completely  attained  the 
required  high  velocity.  Under  the  circumstances  Knudsen  has  assumed 
that  the  molecules  of  mean  square  velocity  Go  on  arriving  at  the  hot  plate 
acquire  only  a  fraction  a  of  the  required  velocity  increase  (G/  —  G2) 
and  those  reaching  the  cold  plate  lose  only  the  same  fraction  a  of  the 
required  decrease  (Gi  —  G2O.  giving  us  the  equations 

Gi  =  G2  +  a(G/  — G2).  (467) 

G2  =  Gi  — a(Gi  — G2O  (468) 

where  a  has  been  named  by  Knudsen  the  accommodation  coefficient. 
Combining  equations  (467)  and  (468)  we  obtain 

G,  +  G2  =  G/  +  G2'  (469) 

and  substituting  this  relation  together  with  (466)  into  (465)  we  can  obtain 
the  desired  expression  for  the  force  of  repulsion  per  square  centimeter 


=K^^^^*^-0 


(470) 


where  p  and  T  are  the  pressure  and  temperature  of  the  surrounding  gas, 
and  Ti  and  T2  the  temperatures  of  the  two  plates. 

This  equation  was  made  by  Knudsen  ^^  the  basis  of  his  "absolute  gage" 
for  measuring  the  pressure  of  highly  rarefied  gases. 

257.  Transfer  of  Energy  at  Very  Low  Pressures  between  Plates 
at  Different  Temperatures. — By  similar  methods,  we  can  also  investi- 
gate the  transfer  of  energy  by  a  highly  rarefied  gas  between  two  plates 
at  different  temperatures  T^  and  To.     We  may  again  treat  the  gas  be- 

"  Knudsen,  Ann.  der  Phys..  32,  809  (1910). 
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tween  the  two  plates  as  consisting  of  N-^  molecules  per  cubic  centimeter 
moving  from  the  hotter  to  the  cooler  plate  and  N2  per  cubic  centimeter 
moving  in  the  opposite  direction  and  vv^rite  the  equation 

N^Q^  =  NA=^^0  (471) 

vi^here  the  symbols  have  the  same  significance  as  in  the  last  section. 

For  the  average  time  taken  by  a  molecule  to  make  a  round  trip  between 
the  plates  we  shall  evidently  have  2d/Qx  -\-  Id/Q^.,  and  hence  for  the  num- 
ber of  round  trips  per  second  per  square  centimeter,  can  write 

,_     {N,-^N,)d    _{N,  +  N,)Q,Q, 


2d 


(i+i) 


and  substituting  equation   (471)   can  obtain 

Z  =  ^Q  (473) 

just  as  in  equation   (450)   for  the  case  where  the  two  plates  are  at  the 
same  temperature. 

Let  us  consider  now  first  the  internal  energy  which  the  molecules 
transport  from  the  hotter  plate  to  the  cooler  plate.  On  the  average  this 
will  evidently  stand  in  the  ratio  {Cv —  2)/2R)/{Z/2R)  to  the  kinetic  energy 
of  the  molecules,  where  C^  is  the  molal  specific  heat  at  constant  volume. 
Hence  combining  with  equation  (473)  we  may  write  for  the  rate  of  trans- 
fer of  internal  energy  per  square  centimeter  per  second 

3 

Q,,,  =  '^Q        3  ~  m{G^  -  G,^)  (474) 

2^ 

Turning  now  to  the  kinetic  energy  of  translation  of  the  molecules,  we 
cannot  calculate  this  directly  by  multiplying  the  number  of  transfers  by 
the  mean  kinetic  energy,  since  the  molecules  which  travel  across  in  the 
shortest  time  are  also  the  ones  which  carry  the  most  kinetic  energy.  A 
simple  computation  shows,  however,  that  this  rate  of  transfer  has  under 
our  assumptions  the  value 

Ou.n.  =  ^^Q\m{G,'-  G,-^)  (475) 

which  combined  with  equation  (474)  gives  us  for  the  total  rate  of  transfer 
of  energy  per  square  centimeter  per  second 
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The  quantities  Gi"  and  ^2^  occurring  in  this  formula  are  the  mean  square 
velocities  of  the  two  streams  of  gas  which  we  have  postulated  as  moving 
in  the  two  directions  between  the  plates  at  T^  and  To.  ExjDeriment  shows, 
however,  that  we  cannot  assume  these  quantities  proportional  to  Ti  and  T^, 
since  complete  temperature  equalization  does  not  take  place  on  reflection. 
If  we  again  make  the  assumption  of  Knudsen  as  given  by  the  former 
equations  (467)  and  (468)  we  can  obtain 

G,^  —  G^  =  —^—  (Gi'^  —  G/2)  (477) 

Z  —  a 

where  a  is  Knudsen's  accommodation  coefficient  and  G/^  and  Go'"  are  the 
mean  square  velocities  that  actually  correspond  to  the  temperatures  Ti 
and  Ts.  Making  the  substitution  in  (476)  and  inserting  the  known 
relations  between  p,  Q,  G  and  T  given  by  the  gas  law  and  equations  (95) 
and  (96),  Qiapter  5,  we  obtain 


The  derivation  of  this  formula  is  due  to  Knudsen.^**  It  suffers,  as 
did  the  work  in  the  last  section,  from  uncertainty  as  to  the  best  method 
of  allowing  for  the  lack  of  equalization  of  temperature  when  molecules 
are  reflected.  The  justification  for  the  particular  form  of  the  equations 
(467)  and  (468),  where  the  accommodation  coefficient  was  first  intro- 
duced, does  not  seem  certain.  The  equations  are  certainly  valuable,  how- 
ever, in  giving  an  idea  of  the  nature  of  the  phenomena  to  be  expected. 
Experiment  shows  that  the  accommodation  coefficient  for  a  given  sur- 
face and  gas  is  relatively  independent  of  the  temperature.  It  has  a 
higher  value,  as  would  be  expected,  for  a  platinized  than  for  a  smooth 
platinum  surface.  When  the  two  surfaces  are  of  different  materials  or 
are  not  parallel,  the  theory  must  be  modified  along  evident  lines.  For  fur- 
ther details  Knudsen's  paper  (/.  c.)  must  be  consulted. 

258.  Transport  as  Determined  by  Mean  Free  Path. — In  the  case 
of  gases  at  higher  pressures  than  those  that  we  have  just  been  consider- 
ing, the  motion  of  the  molecules  engaged  in  transferring  matter,  energy 
or  momentum  will  no  longer  be  tenninated  in  general  by  collision  with 
the  walls  of  the  apparatus,  but  rather  by  collision  with  other  molecules 
of  the  gas,  and  to  handle  transport  problems  at  higher  pressures  we  must 
introduce  the  idea  of  the  mean  free  path  which  molecules  travel  between 

"Knudsen,  Ann.  dcr  Physik,  34,  593  (1911). 
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their  collisions.  Having  a  value  for  this  free  path,  we  can  then  calculate 
the  rate  at  which  any  superimposed  disturbance  is  carried  through  the 
gas  by  the  motion  of  the  molecules,  under  the  assumption  that  the 
velocities  and  free  paths  are  the  same  as  they  would  be  under  equilibrium 
conditions.  We  must  again  emphasize,  however,  that  such  calculations  are 
only  approximate  in  nature,  since  they  do  not  allow  for  the  efifect  of  the 
disturbance  itself  in  changing  the  Maxwell-Boltzmann  distribution. 

The  expressions  to  be  obtained  can  be  expected  to  have  validity 
only  in  a  range  of  pressure  high  enough  so  that  collisions  with  the  walls 
may  be  neglected  in  comparison  with  collisions  with  other  molecules,  but 
not  so  high  that  the  effect  of  triple  collisions  between  molecules  will  have 
to  be  taken  into  consideration. 

259.  Probability  for  a  Path  of  a  Given  Length. — Before  making 
any  calculations  as  to  the  dependence  of  free  path  on  size  or  velocities 
of  the  molecules,  we  may  first  consider  some  general  characteristics  of  the 
mean  free  path. 

Let  us  denote  by  /(/)  the  probability,  under  equilibrium  conditions 
that  a  molecule  of  a  given  class  (i.e.  having  a  given  shape,  size,  and 
velocity)  will  traverse  the  distance  /  without  making  a  collision.  For 
the  chance  that  it  will  traverse  the  longer  distance  /  +  dl,  we  shall  then 
have  f(l-\-dl).  We  may  also,  however,  set  the  chance  that  it  will 
make  a  collision  in  the  distance  dl  equal  to  adl  where  a  is  a  constant  of 
proportionality,  and  hence  the  chance  that  it  will  not  make  collision  in  the 
distance  I  -{-  dl  equal  to  the  product  of  the  two  probabilities  /(/),  that  it 
will  travel  the  distance  /  multiplied  by  the  probability  (1 — adl),  that  it 
will  travel  the  further  distance  dl.     Equating,  we  obtain 

f{l  +  dl)  =f{l){l  —  adl) 

or 

fil)+^dl  =  f{l)-af{l)dl 

which  gives  us  on  integration 

f{l)  =  e-'^'  (479) 

the  constant  of  integration  being  determined  by  the  fact  that  f{l)  must 
evidently  be  unity  when  /  itself  is  zero. 

To  interpret  the  constant  a  we  may  write  for  the  chance  of  collision 
within  the  distance  I 

I— f  (I)  =  1  —  0-"^  (480) 

or  by  differentiation  for  the  chance  of  collision  in  the  range  /  to  /  +  ^^ 

—  dfil)=a^^dl  (481) 
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The  mean  free  path  X  for  the  molecules  under  consideration,  however, 
will  evidently  be  given  by  integrating  over  all  possible  distances  /,  the 
product  of  /  times  the  chance  of  termination  in  the  particular  range  / 
to  /  +  dl,  giving  us 

X  = 


=  (ae-^^  Idl  (482) 


Performing  the  integration  and  substituting  the  limits  we  obtain 

X  =  -  (483) 

a 

Hence  for  the  probability  div  that  the  free  path  of  a  molecule  will  terminate 
in  any  particular  interval  dl  we  have 

dw  =  J  (484) 

and  referring  back  to  equations  (479)  and  (481),  we  may  write  for  the 
chance  of  traversing  a  path  /  without  collision 

f{l)=e-~i:  (485) 

and  for  the  chance  of  a  free  path  of  length  between  /  and  I  -{-  dl 

—  df{l)=e-^j  (486) 

•  We  note  at  once  that  the  probability  of  a  free  path  much  longer  than 
the  mean  is  very  small.  Thus  from  equation  (485)  the  chance  of  travel- 
ling a  path  twice  X  is  only  0.135,  and  the  chance  of  travelling  a  path 
five  times  X  only  0.0067. 

260.  Mean  Free  Path  and  Path  to  Go. — We  must  now  consider 
a  matter  of  interpretation  which  is  sometimes  puzzling.  The  quantity  X, 
which  was  introduced  above  and  called  the  mean  free  path  for  mole- 
cules of  the  class  in  question,  is  from  its  definition  by  equation  (482) 
evidently  the  free  path  that  any  chosen  molecule  still  has  to  travel  on 
the  average  after  it  has  been  picked  out.  Owing,  however,  to  the 
assumption  that  the  chance  of  collision  adl  is  the  same  for  any  range  dl, 
the  length  of  its  path  up  to  the  time  of  choice  has  no  effect  on  the 
probable  length  of  the  succeeding  path.  Hence,  since  we  can  if  desired 
pick  the  molecule  immediately  after  a  collision,  the  quantity  X  will  also  be 
the  mean  free  path  travelled  by  molecules  of  the  class  in  question  between 
collisions. 
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Furthermore,  it  must  be  noted  if  we  pick  molecules  of  the  class  in 
question  arbitrarily,  that  X  will  also  be  on  the  average  the  free  path 
which  they  have  travelled  siiKe  their  last  collision,  owing  to  the  fact  that 
the  chance  of  collision  in  any  previous  element  dl  also  had  the  value  adl. 

A  simple  example  due  to  Boltzmann  will  perhaps  make  the  condition 
of  affairs  clearer.  If  we  consider  the  throws  of  a  cubical  die,  the  average 
number  of  throws  between  two  throws  of  a  six  spot  will  evidently  be 
five.  In  addition,  however,  at  any  time  in  a  series  of  throws  the  average 
number  of  throws  until  we  get  a  six  spot  will  be  five  and  the  average 
number  since  we  got  a  six  spot  will  be  five. 

261.  Mean  Free  Path  of  Clausius. — The  quantity  X  introduced 
above  was  the  mean  free  path  for  molecules  of  a  particular  class.  If 
we  desire  now  an  average  for  molecules  of  all  the  classes  present,  we 
shall  obtain  different  expressions  for  the  value,  depending  on  the  way 
the  average  is  taken.  The  particular  expression  for  the  free  path  to 
be  chosen  for  subsequent  discussions  will  of  course  be  determined  by 
the  kind  of  average  that  is  demanded  by  the  problem  under  discussion. 

The  first  calculation  of  mean  free  path  is  due  to  Clausius,  and  was 
made  on  the  assumption  that  all  the  molecules  had  the  saine  velocity, 
distributed  in  random  directions.  Postulating  rigid  spherical  molecules 
the  value  thus  obtained  was 


0.75 


•  Clausius  — 


AnNo-        TiNo- 


(487) 


where  A^  is  the  number  of  molecules  per  cubic  centimeter  and  o  is  the 
diameter  of  a  molecule.  The  quantity  given  is  the  average  distance 
travelled  between  collisions  by  any  molecule  of  the  assumed  system. 

262,    Mean  Free  Path  for   Molecules   of  a   Given  Velocity. — In 

the  case  of  an  actual  gas  with  molecular  velocities  distributed  according 
to  Maxwell's  law,  the  free  path  for  molecules  of  higher  velocities  will 
be  somewhat  greater  than  for  those  of  lower  velocity,  and  the  problem 
is  much  more  complicated.  It  can  be  shown, ^'^  however,  that  for  mole- 
cules having  velocities  in  the  range  c  to  c  -\-  dc  the  mean  free  path  will 
be  given  by  the  expression 

2kT 
X^=  £^f_ ^^-^  (488) 


^No^w(c^^) 


"  Derivations  for  the  above  and  succeeding  expressions  for  mean  free  path  are 
best  found  in  Jeans  "Dynamical  Theory  of  Gases,"  Cambridge  University  Press. 
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where  ip{x)  has  the  form  ^^ 

x 

ip(x)  =  xe-^'  +  (2.r==  +  1)  Ce'y'dy  (489) 

0 

263.  Maxwell's  Mean  Free  Path. — For  a  system  having  the  equi- 
librium distribution  of  velocities^  the  mean  free  path  for  all  the  mole- 
cules of  the  system  was  defined  by  Maxwell  as  the  total  distance  travelled 
in  one  second  by  all  the  molecules  present  divided  by  the  total  number 
of  free  paths  described  in  that  time. 

For  the  total  distance  L  travelled  by  the  molecules  of  the  system  we 
may  take  the  product  of  the  number  of  molecules  N  and  their  average 
velocity  Q  giving  us 

L  =  NQ  =  N  V-^  (490) 

To  calculate  the  total  number  of  free  paths  described  per  second,  we 
note  from  equation  (484)  that  the  chance  of  collision  for  molecules 
having  velocities  between  c  and  c  -\-  dc  is  //Ac  per  centimeter  length  of 
path,  or  c/Xc  per  second.  Furthermore,  each  collision  terminates  two 
mean  free  paths.  Hence  for  the  total  number  of  mean  free  paths  described 
per  second  we  may  take 


Z 


:=2JjdNc  (491) 


where  dNc  is  the  number  of  molecules  having  velocities  in  the  range  c 
to  c  +  dc. 

Substituting  for  Xc  the  value  given  by  equations  (488)  and  (489) 
and  for  dNc  the  value  given  by  equation  (92)  in  Chapter  5,  it  is  found 
possible  to  integrate  equation    (491)    and  obtain  the  result  ^'^ 

Z  =z  4N'o-  V^^  =  V2  7i  No^n  (492) 

Dividing  equation  (490)  by  (492)  we  obtain  for  Maxwell's  value  of 
the  mean  free  path 

"  Tables  for  the  value  of  this  function  were  calculated  by  Tait,  Edin.  Trans.,  33, 
74  (1886)  and  may  be  found  in  Jeans  (I.  c). 

If  c  is  one  half  the  average  velocity  Xc/'k  Maxwell  drops  to  0.6411,  while  if  c  is 
infinity  'kc/'K  Maxwell  rises  only  to  1.4142.  Hence  it  is  evident  that  the  mean 
free  paths  are  closely  concentrated  for  molecules  of  all  speeds. 

"  This  is  of  course  twice  the  expression  we  have  obtained  in  Chapter  5,  equation 
(121),  for  the  total  number  of  collisions  per  second.  The  calculation  is  given  in  the 
present  form  to  show  the  analogy  with  the  following  calculations  of  other  types 
of  mean  free  paths. 
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1  0.707 


Maxwell 


V2  71  No-       ^^0' 


(493) 


In  the  case  of  spherical  molecules  of  more  than  one  kind,  Maxwell's 
value  for  the  mean  free  path  for  the  molecules  of  a  particular  kind  of 
mass  uti  assumes  the  form 

^  = \ (494) 


n^N^ 


VI    I    ■^i 
1  H 


where  Ojs  is  the  average  diameter  of  molecules  of  kind  1  and  kind  s, 
and  the  summation  is  to  be  taken  for  all  kinds  of  molecules  present. 

264,  Tait's  Mean  Free  Path.^ — Again  considering  a  system  of  mole- 
cules with  the  Maxwell  distribution  of  velocities,  we  have  a  definition 
due  to  Tait  of  rather  clearer  physical  significance  than  the  foregoing. 
Tait  takes  the  mean  free  path  as  the  average  value  of  Ac  for  the  mole- 
cules of  all  the  d.ifferent  velocities  present.  Hence  if  we  consider  a 
given  instant  of  time,  Tait's  mean  free  path  will  be  the  average  distance 
travelled  by  all  the  molecules  before  their  next  collision. 

In  accordance  with  the  definition  we  may  write 

A  Tait  =jjCUNc  (495) 

0 

On  substituting  the  value  of  Ac  given  by  equations  (488)  and  (489)  and 
the  value  of  dNc  given  by  equation  (92)  in  Chapter  5,  an  expression  is 
obtained  which  must  be  integrated  by  quadrature.  This  has  been  done 
both  by  Tait  and  Boltzmann  and  leads  to  the  value 

^-=^  (496) 

265.  Mean  Free  Path  for  Transport  Problems. — Finally  we  may 
define  a  special  value  of  the  mean  free  path,  which  will  be  useful  in 
interpreting  the  transport  of  disturbances,  as  the  average  value  of  IcC 
divided  by  the  average  velocity  Q  in  accordance  with  the  equation 


CCj 

t  =  ^  ^J^ccdNc  (497) 


"Transpor 

Substituting  the  values  of  Ac  and  dNc,  this  also  leads  to  an  expression 
which  has  been  evaluated  by  Boltzmann  and  Tait  by  quadrature,  leading 
to  the  value 
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1  0.743  ,,^_, 

/Transport    —  ^^y^.  (4^0) 

This  is  the  last  of  the  methods  of  defining  the  mean  free  path  in  a 
system  of  molecules  that  we  shall  consider.  Other  definitions  would  of 
course  give  other  values  which  would  sometimes  be  of  interest.  Owing 
to  the  fact  that  Xc  does  not  vary  rapidly  with  the  velocity  c,  different 
methods  of  defining  the  mean  free  path  lead  to  results  which  do  not 
differ  greatly  as  will  be  seen  by  comparing  equations  (487),  (493),  (496), 
and   (498). 

266.  General  Equation  for  the  Transport  of  a  Disturbance  as 
Dependent  on  Mean  Free  Path. — We  may  now  use  the  foregoing  in- 
formation concerning  mean  free  paths  to  investigate  the  transport  of 
matter,  energy  and  momentum,  through  gases,  which  are  held  at  pres- 
sures such  that  the  free  motions  of  the  molecules  are  in  general  terminated 
by  simple  collisions  between  pairs  of  molecules. 

Consider  a  gas  sensibly  in  a  state  of  equilibrium  but  with  a  super- 
imposed disturbance  which  is  being  transported  in  the  direction  of  the 
s  axis  owing  to  a  variation  in  the  intensity  of  the  disturbance  as  we 
proceed  along  that  axis.  This  variation  makes  the  molecules  coming  from 
one  direction  carry  a  greater  amount  of  the  disturbance  than  those  coming 
from  the  opposite  direction  and  thus  leads  to  the  transfer  of  the  disturbance 
through  the  gas. 

To  calculate  the  rate  of  transport  of  the  disturbance  let  us  consider  a 
plane,  perpendicular  to  the  direction  of  flow  located  at  the  point  s  =  Zo, 
and  investigate  the  motion  of  the  molecules  from  the  two  sides  through 
this  plane.  We  shall  first  fix  our  attention  on  molecules  having  velocities 
in  the  range  c  to  c  -\-  dc  which  pass  through  this  plane  at  an  angle  to 
the  normal  between  ■&  and  ■&  -\-  d^.  The  number  so  crossing  per  second, 
from  one  side  to  the  other  of  the  plane,  will  be  proportional  to  the  solid 
angle  27i  sin§d&  in  which  the  direction  of  motion  must  lie,  proportional 
to  their  component  of  velocity  c  cos?^  parallel  to  the  z  axis,  and  propor- 
tional to  the  number  dNc  per  cubic  centimeter  which  have  the  velocity 
under  consideration,  so  that  we  may  write  for  the  number  of  such  cross- 
ings per  second  per  square  centimeter  of  the  surface  the  quantity 

,„       2nsin-&dd'  „   ,,.        c  sin  i?  cos  i9'  ,,^   ,„  /  Ar^n\ 

dZ  = -. c  cos  ^  dNc  = p^ dNcd&  (499) 

471  2 

If  now  Xc  is  the  mean  free  path  for  molecules  of  velocity  c,  the  mole- 
cules under  consideration  will  on  the  average  have  made  their  last  col- 
lision at  a  distance  -°  Xc  cos  §  from  the  plane  s  =  Zq.     Hence,  if  we  denote 

*•  See  Section  260. 
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by  G(2)   the  average  value  of  the  disturbance  per  molecule  for  gas  in 

the    layer    s  =  s,    the    molecules    denoted    above    may    be    assumed    to 

dG 
bring  with  them  the  quantity  of  disturbance  G(2o)  +  Ac  cos  '&-^  if  they 

come  from  the  direction  of  larger  values  of  s,  and  the  quantity 
G(^'o)  — Ac  cos  V  -^,  if  they  come  from  the  opposite  direction.  Com- 
bining with  equation  (499)  for  the  number  of  crossings  from  each  side, 
we  obtain  for  the  net  rate  of  transport  of  the  disturbance,  in  the  direction 
of  decreasing  values  of  G,  due  to  the  molecules  under  consideration 

dTe,  c  =  Acc  sin  ^  cos-  §  -^  dNcd&  (500) 

and  integrating  for  all  possible  values  of  -&  from  0  to  7i/2,  we  easily 
obtain 

drc  =  ^dNc^  (501) 

for  the  transport  produced  by  molecules  having  velocities  in  the  range  c 
to  c  -|-  dc. 

Finally  to  obtain  the  total  rate  of  transport  for  molecules  of  all 
velocities  c  =  0  to  co,  we  must  substitute  the  value  of  Ac  given  by  equa- 
tion (488)  in  Section  262,  and  for  dNc  the  value  given  by  equation  (92) 
in  Chapter  5  and  integrate  from  zero  to  infinity.  From  the  definition  of 
A  Transport  given  in  the  preceding  section,  however,  it  is  evident  that  this 
will  give  us  for  the  total  rate  of  transport  per  square  centimeter  per 
second 

0 

and  for  rigid  spherical  molecules  of  diameter  o  this  gives  us  in  accordance 
with  equation  (498) 

^       0.743 


3710" 


r,/()G\       115    fT/dG\  ,-^,, 


It  should  be  specially  noticed  that  the  rate  of  transport  at  these  higher 
pressures  is  independent  of  the  density  of  the  gas  at  constant  temperature. 
This  seemingly  surprising  result,  which  agrees  with  experiment,  was  re- 
garded as  a  great  triumph  of  the  older  kinetic  theory.  The  result  is 
due  of  course  to  the  fact  that  although  an  increase  in  density  furnishes 
more  molecules  to  transport  the  disturbance,  it  decreases  in  the  same  ratio 
the  free  path  which  the  molecules  travel  without  collision.     At  very  low 
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pressumes,  as  we  have  already  seen,  where  the  motions  of  the  molecules 
are  terminated  by  the  walls  of  the  apparatus,  rates  of  transport  become 
proportional  to  the  pressure. 

267.  The  Transport  of  Momentum  (Viscosity). — Equation  (502) 
may  now  be  used  to  investigate  the  transport  of  momentum  through  a 
gas  and  thus  to  obtain  an  expression  for  its  viscosity. 

Consider  a  gas  in  which  the  mass  velocity  u  in  the  x  direction  varies 
from  point  to  point  as  we  go  along  the  z  axis.  Then  for  the  average 
amount  of  directed  momentum  i^er  molecule  in  any  layer  perpendicular 
to  the  s  axis  we  may  write 

G  =  mu  (504) 

and  substituting  in  equation  (502),  obtain  for  the  rate  of  transport  of 
momentum  per  square  centimeter  across  a  given  surface  perpendicular 
to  the  s  axis 

r  =  ;= A  Transport   I   "^  )  (505) 

The  rate  of  transport  of  momentum  per  square  centimeter  per  unit  velocity 
gradient  is,  however,  by  definition  the  viscosity  ?]  of  the  gas,  so  that  we 
obtain  for  the  viscosity  the  simple  expression 

V]  =  :, A  Transport  ^^  '^  Q  ^  ^Transport  (506) 

In  case  we  assume  rigid  spherical  molecules  of  diameter  o,  by  substitut- 
ing equation  (498)  for  A  Transport  and  equation  (95),  Chapter  5,  for  Q  we 
can  obtain  for  the  viscosity  in  c.g.s.  units 

r)  =  1.89  X  10-^^^^^^  (507) 

o 

so  that  a  measurement  of  viscosity  leads  to  an  experimental  method  of 
determining  the  diameters  of  molecules. 

268.  Flow  of  Gases  at  Moderate  Pressures  through  a  Tube. — 

The  great  difference  in  the  mechanism  by  which  momentum  is  transferred 
through  gases  at  high  and  low  pressures  produces  a  corresponamg  differ- 
ence in  the  formulse  for  flow  through  tubes.  Thus  at  moderate  pres- 
sures we  find  the  flow  in  cubic  centimeters  measured  at  unit  pressure 
governed  by  Poisseuille's  law 

G  =  2qi^>  (508) 


266-269]  TRANSPORT  PROBLEMS  225 

instead  of  by  equation  (457)  in  Section  255,  valid  at  very  low  pressures, 
which  makes  the  flow  proportional  to  the  cube  rather  than  the  fourth 
power  of  the  diameter.  For  intermediate  pressures,  empirical  laws  such 
as  given  by  Knudsen  (/.  c.)  must  be  employed. 

269.    The   Transport   of   Energy    (Thermal    Conductivity). — We 

may  next  use  equation  (502)  to  investigate  the  transport  of  energy 
through  a  gas  and  thus  to  obtain  an  expression  for  the  thermal  con- 
ductivity of  the  gas. 

Consider  a  gas  in  which  the  temperature  T  varies  as  we  proceed 
along  the  z  axis.  For  the  average  energy  per  molecule  in  any  layer 
perpendicular  to  the  z  axis,  we  may  then  write 

G  =  ^T  (509) 

where  Cv  is  the  heat  capacity  per  mol  at  constant  volume,  and  A  is 
Avogadro's  number. 

Substituting  in  equation  (502),  we  obtain  for  the  rate  of  trans- 
port of  energy  per  square  centimeter  across  a  given  surface  perpendicular 
to  the  z  axis 

j._NQ  C.(dT\ 

J.     —  "y^ Transport  ^  1-^1  K^ ^^ ) 

The  coefficient  of  the  temperature  gradient  (dT/dz)  is,  however,  by 
definition  the  thermal  conductivity  of  the  gas  x,  giving  us 

_Nn  C„  .-,.. 

^  —  ~^T~  •^  Transport  "~T~  W  -l  •■■  J 

or  combining  with  (506)  and  noting  that  Cv/Aui  is  evidently  the  specific 
heat  Cv  we  obtain 

X  =  fjCv  (512) 

a  relation  between  viscosity  and  thermal  conductivity  which  formed  one 
of  the  early  triumphs  of  the  kinetic  theory. 

The  derivation  just  given  suffers  of  course  from  the  general  error 
introduced,  as  emphasized  in  Sections  248  and  258,  by  neglecting  the 
effect  of  the  superimposed  disturbance  in  altering  the  distribution  from 
the  equilibrium  Maxwell-Boltzmann  law.  In  addition,  moreover,  the 
present  derivation  makes  no  allowance  for  the  fact  that  the  fastest  moving 
molecules  are  the  ones  that  bring  the  most  translational  energy,  which 
will  tend  to  make  the  value  of  x  larger  than  that  given  by  equation 
(512).  In  view  of  other  uncertainties,  however,  it  does  not  seem  im- 
portant to  correct  the  calculation  so  as  to  allow  for  this.    Experimentally, 
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the  right-hand  side  has  to  be  modified  by  introducing  a  factor  which 
varies  from  1.5  to  2.5  for  different  gases.  The  more  complex  the  mole- 
cule under  consideration,  the  smaller  the  value  of  this  experimental  fac- 
tor is  found  to  be,  agreeing  with  the  natural  hypothesis  that  the  internal 
energy  is  transported  more  nearly  as  required  by  equation  (510).  Using 
the  more  refined  methods  previously  mentioned,  Chapman  {I.  c.)  finds  for 
elastic  spheres  the  multiplier  2.522  and  this  is  not  far  from  the  value  ex- 
perimentally found  for  monatomic  gases. 

270.  The  Transport  of  Matter  (Diffusion). — A  satisfactory  in- 
vestigation of  the  diffusion  of  gases  into  each  Other  by  elementary 
methods  using  the  concept  of  mean  free  paths  is  difificult.  The  following 
relatively  simple  treatment  which  leads  to  the  reasonably  good  formula 
of  the  Stefan-Maxwell  theory  is  perhaps  as  elaborate  as  the  inherent 
weaknesses  of  the  method  would  justify. 

Consider  a  gaseous  system  containing 

N  =  N^-\-N2  (513) 

molecules  per  cubic  centimeter,  Nx  of  the  first  kind  and  A^2  of  the  second 
kind.  We  shall  assume  that  the  proportions  A^i  and  A^2  vary  as  we  pro- 
ceed along  the  z  axis,  thus  leading  to  the  process  of  diffusion.  By 
Avogadro's  principle,  however,  since  the  pressure  throughout  the  system 
is  constant,  the  total  number  A^  per  cubic  centimeter  will  be  constant, 
which  leads  by  differentiation  of  equation  (513)  to  the  result 


dN^__dNj 
dz  dz 


(514) 


For  simplicity  we  shall  consider  all  the  molecules  of  each  kind  as 
having  identical  velocities  Q^  and  Q2  distributed  in  random  directions, 
and  in  addition  a  component  Wq  along  the  z  axis  which  corresponds  to 
any  mass  flow  in  that  direction  that  may  result  from  the  diffusion  process. 
The  quantities  Qi  and  Q^  will  be  taken  as  the  average  thermal  velocities 
corresponding  to  the  temperatures  and  the  quantity  zvq  will  be  determined 
in  the  course  of  the  demonstration.  The  free  paths  of  the  two  kinds  of 
molecules  will  be  denoted  by  the  symbols  Xx  and  /I2. 

Consider  now  a  plane  perpendicular  to  the  direction  of  flow  at  the 
point  z  =^  Zq  moving  itself  with  the  velocity  Wq  and  let  us  fix  our  atten- 
tion on  molecules  of  the  first  kind  which  pass  through  this  plane  at  an 
angle  to  the  normal  between  d^  and  d^  -{-  <!■&.  The  number  so  crossing  per 
second,  from  one  side  of  the  plane  to  the  other,  will  be  proportional  to 
the  solid  angle  2n  s,m^d§  in  which  the  direction  of  motion  must  lie,  pro- 
portional to  their  component  velocity  Qx  cos  d'  parallel  to  the  z  axis  and 
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to  the  number  of  molecules  per  cubic  centimeter, A/*!  zh  Xx  cos  •&  -^ — ,  at  the 

average  distance  dr  Ai  cos  •&,  from  which  they  have  come  since  their  last 
collision.  So  that  we  may  write  for  the  net  passage  of  molecules  per 
second  per  square  centimeter  in  the  direction  of  increasing  values  of  2 

dZx  = -7 ih  cosv  2/i  cost/  -^ —  (515) 

which  on  integration  over  all  possible  values  of  ^  from  0  to  7i/2  easily 
gives  us 

Z,  =  -\q,X,^  (516) 

This  is  the  number  of  molecules  crossing  a  plane  which  is  itself  moving 
with  the  velocity  Wq  in  the  z  direction,  i.e.  which  has  the  mass  velocity 
of  the  gas.  Hence  the  total  number  crossing  a  stationary  plane  in  the 
positive  direction  per  square  centimeter  per  second  will  be 

Z/  =  iV.Wo  —  \  Oxh  ^  (517) 

Similarly  the  number  of  molecules  of  the  second  kind  crossing  the  plane 
in  the  same  direction  will  be 

1  dNo 

Z2'  =  A^.wo  —  ^  ^2^0  -^  (518) 

Since  in  accordance  with  equation  (513),  however,  the  total  number  of 
molecules  on  each  side  of  the  plane  must  be  constant,  these  two  quantities 
must  be  equal  and  opposite  in  sign,  permitting  us  to  obtain  for  Wq  the 
value  ^^ 

dz  dz  /cin\ 

'""  =  — m.-^N,) —  (51^^ 

and  substituting  this  value  in  equations  (517)  and  (518),  noting  also  the 
relation  (514),  we  obtain 

,  _       N,Q,X,  4-  NoQ.X,  dNx  .  -^_ - 


and 


constancy 


3(iV,  +  A^,)       dz  -"]  -  ^^^^^ 

rnor 

The  quantity  Wo  may  be  regarded  as  the  mass  velocity  necei        f    r  ireserve  the 
:ancy  oi  N  =  N^-\-  N2.  O* 
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By  definition,  however,  the  coefficient  for  the  concentration  gradient  in 
the  expression  for  the  rate  of  diffusion  is  the  diffusion  coefficient  D 
for  the  substance  in  question,  giving  us 

the  same  for  both  gases  in  the  mixture. 

To  interpret  equation  (522)  we  must  next  inquire  what  vakies  to  use 
for  the  mean  free  paths  Ix  and  l-,-  For  a  mixture  of  two  gases.  Maxwell's 
value  for  the  mean  free  path  takes,  in  accordance  with  equation  (494), 
the  form 

/li  = ^ , (523) 

On  examination  this  expression  shows  that  the  free  path  is  limited  both 
by  the  concentration  A^^^  of  the  molecules  of  the  first  kind  present  in 
the  mixture  and  by  the  concentration  No  of  molecules  of  the  second  kind, 
corresponding  to  the  two  terms  in  the  denominator.  It  seems  hardly 
reasonable,  however,  to  suppose  that  the  rate  of  diffusion  of  the  mole- 
cules of  a  given  kind  will  be  hindered  by  the  collision  of  molecules  of 
this  kind  among  themselves,  since  the  total  momentum  of  the  molecules 
in  the  s  direction  will  not  be  changed  by  such  collisions.  Hence  follow- 
ing the  suggestion  of  Stefan  and  Maxwell  we  may  use  for  Ai  the  value 
it  would  have  if  N-i  were  zero,  and  similarly  for  X^  the  value  obtained 
with  A^2  equal  to  zero.  Substituting  these  values  in  equation  (522),  we 
easily  obtain  as  our  final  expression  for  the  coefficient  of  diffusion 

_  Vm^Qj  +  VnhOi  C524) 

37i{Ni  -\-  N2)oi2^  V  mi-i-m. 

or  introducing  the  proportionality  Qx/^2   =  mn/nh,  we  can  write 

^-  3n(N,-^N,)o,.^  ^^^^^ 

The  deduction  of  this  result  has  been  plausible  rather  than  certain 
in  character.  Nevertheless  the  formula  agrees  reasonably  well  with  ex- 
perimental data,  and  considerably  better  than  it  would  if  we  substituted 
the  full  expression  for  Ai  and  X2  into  equation  (522). 


271.    The^Transport  o£  Electricity  (Electrical  Conductivity). — 

The  phenom(  7^    of  electrical  conduction  in  material  systems  is  due  to 
the  presence  ^^^  electrically  charged  particles,  electrons  or   ions,   which 
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move  under  the  influence  of  the  apphed  electromotive  force,  thus  lead- 
ing to  the  transport  of  electricity  in  the  direction  of  decreasing  potential. 
The  magnitude  of  the  electrical  conductivity  of  a  given  substance  can 
be  quantitatively  accounted  for,  in  a  somewhat  formal  manner,  in  terms 
of  the  concentrations  and  charges  of  the  various  ions  present  and  their 
mobilities. 

Consider  a  system  containing  per  cubic  centimeter  N-^,  No,  N^,  etc., 
ions  carrying  the  charges  v^e,  VoC,  v^e,  etc.,  which  are  multiples  of  the 
electronic  charge  e.  For  the  case  of  a  neutral  system  we  must  evidently 
have  the  total  charge  equal  to  zero,  giving  us 

(A^in  +  A^o^o  +  A^3»'3  +  .  ■)  =  0  (526) 

but  our  considerations  do  not  need  to  be  limited  to  this  case. 

Let  the  mobilities  of  the  various  ions  present,  that  is  the  average 
velocities  which  they  attain  in  the  direction  of  the  field  under  unit  potential 
gradient,  be  represented  by  U^,  U2,  U^,  etc.  It  is  then  evident  that  the 
current  flowing  per  square  centimeter  under  unit  potential  gradient  will  be 

/  rr:  N^v^eU^  +  N.v.eU^  +  N^v.eU^  +  . . .  (527) 

where  it  should  be  noted  that  positive  values  of  v  will  evidently  be 
associated  with  positive  values  of  U  and  vice  versa.  Since  the  current 
flowing  per  square  centimeter  under  unit  potential  gradient  is  by  defini- 
tion the  specific  conductivity  of  the  material  in  question,  we  have  thus 
obtained  an  expression  for  that  quantity. 

The  condition  for  Ohm's  law  to  hold  will  be  a  constancy  in  the  num- 
ber of  ions  present  per  cubic  centimeter,  and  a  velocity  proportional  to 
the  potential  gradient,  that  is  the  same  mobility  for  different  gradients. 
In  the  case  of  gaseous  conductors  where  the  number  of  ions  can  be 
increased  at  high  potential  gradients  by  ionization  through  collision,  we 
have  of  course  marked  deviations  from  Ohm's  law.  In  the  case  of 
electrolytic  solutions,  however,  where  the  ionic  velocities  are  small  and 
the  retarding  force  may  be  regarded  as  of  a  quasifrictional  character, 
so  far  no  deviations  from^  Ohm's  law  have  been  observed.  In  the  case 
of  metals,  with  very  high  potential  gradients  corresponding  to  current 
densities  of  5  X  10^  amperes  per  square  centimeter,  increases  in  resistance 
of  the  order  of  one  per  cent  have  been  found  in  gold  and  silver  by 
Bridgman.^^ 

272.    Relation  between  Mobility  and  Coefficient  of  Diffusion. — 

A  simple  relation  between  the  mobility  of  an  ion  and  its  diffusion  co- 
efficient  can   be   derived    by   elementary   methods    which    have    sufficient 

""Bridgman,  Phys.  Rev.,  19,  387  (1922). 
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plausibility  to  justify  their  application  in  view  of  the  complexity  and  un- 
certainty of  more  rigorous  treatment. 

Let  Z  be  the  number  of  molecules  crossing  a  given  surface  per  square 
centimeter  per  second  under  the  influence  of  a  concentration  gradient 
dN/ds,  and  connected  with  this  gradient  and  the  diffusion  coefificient  D  by 
the  equation 

ON 
Z  =  -D^  (528) 

Let  us  now  regard  this  process  of  diffusion  as  due  to  the  force  arising 
from  the  gradient  in  the  partial  pressure  of  the  component  in  question. 
Substituting  the  expression  for  pressure 

p  =  NkT 

we  can  write  equation  (528)  in  the  form 

The  quantity  (dp/ds)ds  can,  however,  be  regarded  as  the  directed  force 
acting  on  the  molecules  in  a  layer  of  area  one  square  centimeter  and 
thickness  da,  so  that  we  can  write  for  the  directed  force  per  molecule 

and  substituting  in  equation  (529)  obtain 

ND 
Z  =  -^F  (531) 

which  connects  the  number  of  molecules  crossing  the  surface  with  the 
force  acting  on  a  molecule. 

On  the  other  hand  in  the  case  of  an  applied  electromotive  force  the 
number  of  ions  crossing  a  surface  of  one  square  centimeter  per  second 
will  be  given  by  the  expression 

dE 
Z  =  —  NU^  (532) 

where  'N  is  the  number  per  cubic  centimeter,  U  their  mobility  and 
{dE/ds)  the  potential  gradient.  The  directed  force  acting  on  each  ion, 
however,  is 


272-273]  TRANSPORT  PROBLEMS  231 

so  that  equation  (532)  can  be  rewritten  in  the  form 

-  "^  "  (534) 


ve 

Identifying  now  the  coefficients  of  the  force  F  in  equations  (531) 
and  (534)  we  obtain  the  desired  relation  between  mobility  and  diffusion 
coefficient  in  the  form 

D=—kT  (535) 

ve 

273,  Diffusion  of  Electrolytes. — If  we  assume  that  the  osmotic 
pressure  (diffusion  pressure)  in  solutions  is  also  given  by  the  relation 
p  =  NkT,  equation  (535)  should  also  hold  in  electrolytic  solutions  where 
it  can  easily  be  tested.  The  possibility  of  such  a  treatment  was  first 
appreciated  by  Nernst. 

As  an  example  let  us  consider  a  binary  electrolyte  completely  dis- 
sociated into  ions  with  charges  -(-  e  and  —  e  and  mobilities  U  and  —  V. 
Since  the  independent  diffusion  of  the  two  ions  at  different  rates  would 
lead  to  strong  electrostatic  forces,  the  ions  will  diffuse  at  the  same  rate, 
a  potential  gradient  (dE/ds)  being  set  up  by  the  attempt  of  the  faster 
ion  to  get  ahead  which  makes  them  actually  proceed  at  the  same  velocity, 
and  keeps  the  concentration  A^  of  the  two  ions  the  same  in  any  layer  of 
the  diffusing  column. 

Under  the  combined  action  of  diffusion  and  the  potential  gradient, 
we  may  now  evidently  write  for  the  number  of  positive  ions  crossing  unit 
area  per  second 

as  as 

and  for  the  number  of  negative  ions 

dN  dE 

as  as 

where  D^^  and  Dn  are  the  diffusion  coefficients.  These  two  rates,  as 
we  saw  above,  however,  must  be  equal,  so  that  by  substituting  equation 
(535),  we  can  write 

Z,  =  Z.  =  -^kT'.^-Nu'^  =  -^kT^  +  NF^ 
e  as  OS  e         as  as 


or  eliminating   (dE/ds) 
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Hence  the  diffusion  coefficient  for  the  salt  as  a  whole  has  the  value 

^  =  ;(^*^  (537) 

Since  ion  mobilities  can  be  determined  from  conductivity  measurements, 
this  relation  is  susceptible  of  easy  investigation,  and  has  been  satis- 
factorly  tested  over  a  wide  range  by  Oholm.^^  A  similar  equation  has 
been  derived  by  Noyes  ^*  for  the  general  case  of  an  electrolyte  giving 
more  than  two  ions. 

274.    Concluding  Remarks  on  the  Method  of  Mean  Free  Paths. — 

As  already  emphasized  in  several  places,  the  method  of  mean  free  paths 
suffers  from  a  logical  deficiency  in  treating  the  molecules  of  the  system 
under  consideration  as  though  they  had  the  equilibrium  Maxwell- 
Boltzmann  distribution  of  coordinates  and  momenta,  in  spite  of  the  fact 
that  the  system  is  not  in  equilibrium  but  is  in  a  disturbed  condition.  The 
method  has  the  great  advantage,  however,  of  relative  simplicity,  and  of 
sticking  close  to  the  qualitatively  correct  physical  picture  of  the  mecha- 
nism of  transport  by  molecular  motions.  In  addition  the  results  obtained 
are  usually  approximately  correct  and  almost  always  give  the  right 
functional  relationship  between  the  physical  quantities  involved  even 
though  experiment  shows  that  the  value  of  the  numerical  constants 
occurring  in  the  equations  must  be  altered. 

In  addition  to  the  purely  logical  fallacy  mentioned  above,  it  will  of 
course  be  appreciated  that  many  physical  hypotheses  have  to  be  made  in 
order  to  treat  actual  problems  by  this  or  any  other  method.  A  few 
remarks  with  respect  to  such  hypotheses  will  not  be  out  of  place. 

In  considering  the  problems  of  viscosity  and  thermal  conductivity,  it 
was  assumed  that  molecules  crossing  a  given  plane,  having  made  their 
last  collision  on  the  average  at  a  distance  A  cosi^  from  this  plane,  will 
bring  with  them  an  amount  of  the  disturbance  in  question  correspond- 
ing to  the  mean  value  of  the  disturbance  at  the  distance  A  cosi?^.  It  is 
evident,  however,  since  bodies  tend  to  continue  in  the  same  line  of  motion 
after  collision,  that  molecules  coming  from  collision  in  the  layer  at  X  cosi^ 
will  on  the  average  have  come  previous  to  this  collision  from  a  still 
greater  distance  from  the  plane  in  question.  This  tends  to  increase  the 
rate  of  transport  over  that  calculated  by  the  simple  theory. 

The  question  of  persistence  of  velocity  has  been  elaborately  treated 
by  Jeans.  For  rigid  spherical  molecules  of  the  same  mass  he  finds 
on  the  average  that  molecules  retain  about  forty  per  cent  of  their  velocity 
in  the  original   direction.      For  rigid  molecules   of   different   masses   the 

^'Oholm,  Zcits.  f.  phvsik.  Chan..  50.  309   (1904)  ;  70,  378  (1910). 
^See  Haskell,  Phys.'Rev.,  27,  145  (1908). 
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persistence  can  vary  from  zero  to  unity,  according  as  the  molecule  under 
consideration  is  very  light  or  heavy  compared  with  the  one  with  which  it 
collides.  For  non-rigid  molecules  the  definition  of  a  collision  becomes 
arbitrary  and  the  problem  of  persistence  loses  its  significance. 

With  the  help  of  his  calculations  of  persistence  of  velocity,  consider- 
able progress  has  been  made  by  Jeans  in  bettering  the  older  kinetic 
theory.  It  should  be  remarked,  however,  that  too  great  an  elaboration 
of  the  older  theory  does  not  prove  profitable  when  the  complications 
assume  a  difficulty  comparable  with  those  encountered  in  more  rigorous 
methods  of  treatment.  It  may  also  be  noted  in  passing  that  in  the 
Stefan-Maxwell  treatment  of  gas  difl^usion  given  in  the  present  text, 
the  effect  of  persistence  of  velocities  was  in  some  measure  allowed  for 
by  neglecting  collisions  between  molecules  of  the  same  kind. 

Another  physical  hypothesis  that  should  be  considered  is  the  assump- 
tion of  rigid  molecules  in  the  calculation  of  mean  free  paths.  This  of 
course  is  only  justifiable  as  a  first  approximation,  and  the  experimental 
fact  that  the  viscosity  of  gases  increases  more  rapidly  than  the  square 
root  of  the  temperature  T  as  given  in  equation  (507),  indicates  that 
we  must  regard  the  molecules  as  compressible,  with  lower  values  of 
the  effective  diameter  o  when  the  temperature  and  hence  average  rela- 
tive velocity  of  collision  is  greater.  Appreciable  success  in  this  direction 
has  been  achieved  by  Sutherland  ^^  by  taking  the  effective  diameter  as 
given  by  a  formula  of  the  form 

o'  =  o^'(^l-h^)  (538) 

Viscosity  measurements  at  moderate  temperatures  agree  well  with  this 
formula  but  it  fails  at  very  low  temperatures.^^ 

In  conclusion  it  should  be  pointed  out  that  cases  may  arise  where  the 
extent  of  our  physical  knowledge  does  not  permit  a  satisfactory  applica- 
tion of  the  method  of  mean  free  paths.  Thus  the  Drude  theory  of 
metallic  structure  which  assumes  the  electrons  in  a  metal  to  behave  like 
the  molecules  of  a  gas  must  be  abandoned  in  its  simple  form.  With  the 
help  of  this  hypothesis  Dinide  was  able  to  calculate  the  ratio  of  the 
thermal  to  the  electrical  conductivity  of  metals  by  taking  the  same  mean 
free  path  for  electrons  engaged  in  either  kind  of  conduction.  In  this 
way  he  obtained  a  value  of  the  ratio  at  ordinary  temperatures  which 
agreed  approximately  with  that  found  experimentally  by  Wiedemann  and 
Franz.  The  phenomenon  of  super  conductance  at  low  temperatures,  how- 
ever, indicates  that  a  considerable  change  in  theory,  perhaps  modified  by 
quantum  considerations,  must  be  made.^^ 

"=  Sutherland,  Phil.  Mag.,  36,  50  (1893). 

"  Kammerlingh  Onnes  and  Sophus  Weber,  Communications  Leiden  Phys.  Lab., 
134b,  p.  18. 

"See  Bridgman,  Phys.  Rev.,  17,  161    (1921). 


Chapter  20. 
The  Rate  of  Chemical  Reaction. 

275.  Introduction. — The  rate  at  which  chemical  reactions  proceed 
is  one  of  profound  interest  both  for  theory  and  practice.  With  the 
help  of  thermodynamics  it  is  now  possible  in  large  measure  for  chemists 
to  predict  in  advance  just  which  chemical  reactions  can  take  place,  namely 
those  accompanied  by  an  increase  in  entropy,  and  to  calculate  how  far 
they  will  go  before  equilibrium  is  attained.  The  laws  controlling  the 
specific  rates  at  which  these  reactions  proceed  are,  however,  beyond  the 
scope  of  thermodynamic  reasoning,  and  hence  the  thermodynamist  is  not 
in  a  position  to  foresee  what  will  actually  happen  when  chemical  mate- 
rials are  mixed  together,  since  his  knowledge  of  the  thermodynamically 
possible  reactions  does  not  suffice  for  the  prediction  of  the  actual  re- 
actions which  do  occur.  Indeed  by  a  strange  perversity,  it  often  seems 
as  if  the  reactions  which  are  accompanied  by  the  largest  increases  of 
entropy,  as  for  example  the  combination  of  hydrogen  and  oxygen  to 
form  water,  take  place  at  the  slowest  rates, ^  while  reactions  which  have 
almost  no  thermodynamic  tendency  to  proceed,  as  for  example  the  dis- 
sociation of  nitrogen  tetroxide  into  the  dioxide,  take  place  at  speeds  so 
great  that  they  still  defy  experimental  measurement. 

The  ultimate  solution  for  such  problems  of  reaction  velocity  cannot 
be  looked  for  in  the  field  of  thermodynamics  which  entirely  neglects  the 
atomic  and  molecular  structure  of  matter.  The  mechanism  of  chemical 
reaction  seems  certainly  determined  by  the  nature  of  the  interacting  atoms 
themselves.  In  the  case  of  the  individual  processes  which  occur,  the 
question  is  one  of  atomic  and  molecular  mechanics,  while  for  the  overall 
rate  of  change  in  collections  of  molecules,  the  problem  is  one  of   sta- 

^  Reactions  having  a  large  thermodynamic  tendency  to  proceed  are  in  general 
exothermic,  and  in  some  cases  the  slow  rate  of  reaction  may  be  due  to  the 
difficulty  of  providing  an  escape  for  the  excess  energy  from  the  molecule  formed. 
Thus  the  combination  of  monatomic  hydrogen  to  form  molecular  hydrogen  in  its 
normal  state 

H  +  H  =H2 

involves  the  emission  of  an  enormous  amount  of  energy,  and  if  we  assume  that  the 
non-polar  character  of  the  hydrogen  molecule  prevents  the  emission  of  this  energy 
in  the  form  of  radiation,  a  three-body  collision  would  be  necessary  for  the  reaction. 
This  hypothesis  is  in  agreement  with  the  slow  rate  of  recombination  of  monatomic 
hydrogen  and  its  catalysis  by  solid  substances  which  can  conduct  away  the  heat. 

234 
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tistical  mechanics.  In  the  three  following  chapters  we  shall  present  the 
progress  in  this  field,  in  some  ways  meagre  but  in  many  ways  significant, 
which  has  already  been  made  with  the  help  of  statistical  mechanics. 

276.  Classification  of  Chemical  Reactions. — Chemical  reactions 
may  for  convenience  be  classified  first  of  all  as  homogeneous,  heterogene- 
ous, and  explosive,  depending  on  the  spatial  location  of  the  actual  region 
in  the  system  where  the  chemical  changes  are  taking  place.  Secondly  as 
catalytic  or  non-catalytic,  depending  on  the  necessity  for  the  presence 
of  an  intermediate  substance  to  promote  the  change.  And  thirdly  as 
photochemical  or  thermal,  depending  on  the  presence  or  absence  of  radia- 
tion from  an  external  source. 

277.  In  a  homogeneous  reaction,  the  reacting  substances  are  all  pres- 
ent in  a  single  phase  and  the  chemical  change  proceeds  at  a  uniform  rate 
throughout  the  whole  mixture.  In  particular  no  catalytic  effect  of  the 
walls  of  the  apparatus  must  occur,  or,  if  reaction  does  proceed  more 
rapidly  at  the  walls,  correction. for  this  must  be  made  in  order  to  obtain 
the  true  homogeneous  rate. 

There  are  of  course  many  examples  of  homogeneous  chemical  change. 
The  reactions  most  familiar  of  all  to  the  chemist  are  the  changes  taking 
place  in  aqueous  solution,  and  these,  except  when  they  actually  involve 
steps  such  as  precipitation  or  solution,  are  for  the  most  part  homogeneous, 
since  rates  of  diffusion  to  the  walls  are  so  slow  that  contact  catalysis 
can  usually  play  no  appreciable  part  even  if  there  is  a  tendency  in  that 
direction.  Of  more  interest  for  statistical  mechanics,  however,  are  the 
very  considerable  number  of  homogeneous  gas  reactions  which  have  been 
investigated.  These  include  the  first-order  decompositions  of  nitrogen 
pentoxide  and  sulfuryl  chloride,  the  transformation  of  trimethylene  to 
propylene,  the  second-order  decompositions  of  ozone,  nitrous  oxide,  and 
chlorine  monoxide,  the  reaction  between  hydrogen  and  bromine,  and  a 
considerable  proportion  of  the  reaction  that  takes  place  between  hydrogen 
and  iodine  as  well  as  the  reverse  change  of  hydrogen  iodide  back  into  its 
elements. 

278.  In  a  heterogeneous  reaction,  the  actual  chemical  changes  take 
place  at  the  surface  of  discontinuity  between  two  phases.  This  may 
either  result  from  the  circumstance  that  the  substances  which  are  to  inter- 
act are  present  in  separate  phases  and  can  only  come  in  contact  at  the 
boundary  between  these  phases,  or  may  be  due  to  the  necessity  for 
the  catalytic  action  of  the  surface.  In  the  former  class  are  of  course 
many  of  the  most  important  reactions  of  industry,  including  the  combus- 
tion of  fuels  and  the  calcination  and  smelting  of  ores.     In  the  latter  class 
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are  the  increasingly  important  number  of  gas  reactions  which  are  acceler- 
ated by  contact  catalysis,  including  the  contact  process  for  the  manu- 
facture of  sulfuric  acid,  the  synthesis  of  ammonia  from  its  elements, 
the  oxidation  of  ammonia  for  the  manufacture  of  nitric  acid,  the  hydro- 
genation  of  hydrocarbons,  etc. 

279.  In  an  explosive  reaction,  the  chemical  changes  take  place  in  a 
reaction  zone  which  propagates  itself  through  an  originally  homoge- 
neous system.  The  simplest  examples  are  furnished  by  the  spread  of 
combustion  in  an  explosive  mixture  of  gases  where  the  luminosity  of 
the  reaction  zone  makes  it  possible  to  follow  its  motion  through  the  mix- 
ture using  photographic  methods.  It  is  of  course  apparent  that  the 
distinction  between  homogeneous  and  explosive  reactions  may  sometimes 
be  profitably  disregarded  by  considering  the  moving  reaction  zone  as 
consisting  of  a  set  of  infinitesimal  layers  of  mixture  of  varying  concen- 
tration and  temperature. 

280.  The  distinction  between  catalyzed  and  non-catalyzed  reactions  is 
one  of  great  practical  importance  but  not  of  very  deep  theoretical  sig- 
nificance. In  a  non-catalyzed  reaction  the  chemical  substances  enter 
directly  into  the  changes  which  lead  to  the  final  products.  In  a  cata- 
lyzed reaction,  however,  one  of  the  reacting  substances  must  first  inter- 
act with  the  catalyst  either  to  form  a  definite  "chemical"  compound  or  a 
looser  more  "physical"  aggregate  such  as  an  adsorbed  layer,  the  catalyst 
later  being  regenerated  in  the  course  of  the  further  changes.  The  sepa- 
rate steps  thus  occurring  in  a  catalyzed  reaction  are  themselves  chemical 
or  physical  processes  capable  of  individual  study.  As  pointed  out  by 
Perrin,  the  statement  sometimes  made  that  all  reactions  have  to  be 
catalyzed  is  a  meaningless  one,  since  the  reaction  between  catalyst  and 
reactant  would  then  have  to  be  catalyzed  by  a  second  catalyst,  and  the 
action  of  this  catalyst  promoted  by  a  third  catalyst  and  so  on  ad  infinitum. 

281.  Photochemical  reactions  are  to  be  distinguished  from  tJiermal 
reactions  by  the  circumstance  that  the  former  take  place  under  the  in- 
fluence of  radiation  from  an  external  source  which  is  not  in  tempera- 
ture equilibrium  with  the  reacting  mixture.  In  carrying  out  thermal 
reactions  on  the  other  hand,  the  enclosure  containing  the  reacting  sub- 
stances is  kept  in  surroundings  at  constant  temperature  which  can  supply 
or  remove  energy  in  the  form  of  radiation  as  well  as  by  conduction. 
The  attempt  is  thus  made  to  maintain  both  the  radiation  and  matter 
inside  the  reaction  vessel  in  a  condition  which  corresponds  as  nearly  as 
possible  to  temperature  equilibrium.  This  of  course  does  not  mean  that 
in  thermal  reactions  no  use  is  made  of  the  radiation  present,  nor,  on  the 
other  hand,  that  it  is  actually  possible  to  maintain  the  radiation  of  all 
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frequencies  at  the  densities  required  by  the  Planck  radiation  law.     These 
are  matters  which  must  be  specifically  investigated. 

282.  In  the  classification  of  chemical  reactions  it  should  finally  also 
be  noted  that  the  word  reaction  is  sometimes  used  to  signify  an  indi- 
vidual chemical  step  in  a  chain  or  collection  of  chemical  processes  and 
sometimes  to  signify  the  total  result  of  all  the  steps.  It  is  on  the  whole 
most  logical,  when  any  confusion  from  this  source  might  arise,  to  use 
the  word  reaction  as  applying  to  each  separate  step,  and  to  speak  of 
isolated  reactions  when  there  is  only  one  step  and  a  set  of  simultaneous 
reactions  when  several  steps  are  involved. 

283.  In  the  present  chapter  we  shall  discuss  thermal  reactions.  The 
material  presented  in  the  first  part  of  the  chapter  will  apply  to  homoge- 
neous reactions  and  mainly  to  gaseous  reactions,  since  they  are  the  ones 
most  amenable  to  statistical  mechanical  treatment.  A  brief  treatment 
of  heterogeneous  reactions  will  then  be  given  and  a  few  remarks  con- 
cerning explosive  reactions  will  also  be  included.  In  the  next  chapter 
we  shall  discuss  the  temperature  coefficient  of  reaction  rate,  and  in  the 
following  chapter  shall  treat  photochemical  reactions.  These  latter  are, 
as  a  matter  of  fact,  really  simpler  theoretically  than  thermal  reactions, 
since  the  mechanism  producing  the  changes  is  more  completely  specified 
and  controllable. 

284.  The  Order  of  Chemical  Reactions. — In  the  study  of  reaction 
velocities  a  careful  distinction  must  be  made  between  the  terms  order  of 
reaction  and  mechanism  of  reaction,  the  first  of  these  being  a  matter  of 
direct  experimental  measurement,  and  the  latter  usually  a  matter  of 
more  or  less  uncertain  theoretical  interpretation. 

The  order  of  a  reaction  with  respect  to  one  of  the  reacting  substances 
is  the  power  to  which  the  concentration  of  tbat  substance  enters  in  the 
expression  connecting  rate  of  reaction  with  concentration.  Thus  if  a 
reaction  between  substances  A,  B,  C,  etc.,  is  found  experimentally  at  a 
given  temperature  to  proceed  at  a  rate  which  can  be  expressed  by  the 
equation 

^i^^kC^-Cs'Cc"....  (539) 

the  reaction  is  said  to  be  of  the  order  x,  y,  a,  etc.,  with  respect  to  the  sub- 
stances A,  B,  C,  etc.,  where  C  J^,  C  b,  C  c,  etc.,  are  the  instantaneous 
concentrations  ^  of  the  reacting  substances,  and  ^  is  a  constant. 

*  It  is  often  convenient  and  customary  to  express  the  concentration  of  a  sub- 
stance by  its  chemical  formula  enclosed  in  brackets,  and  this  practice  will  be  em- 
ployed in  what  follows. 
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In  writing  this  equation  of  order  it  is  often  convenient  to  use  equiva- 
lent concentrations  in  expressing  the  quantities  C  ^,  C  b,  C  c,  etc.,  since 
then  the  rate  of  change  of  concentration  will  have  the  same  numerical 
value  for  the  different  substances  involved.  For  purposes  of  statistical 
mechanical  treatment,  however,  it  is  more  satisfactory  to  use  molal  con- 
centrations which  are  proportional  to  the  number  of  molecules  present, 
or  in  many  cases  to  express  the  concentrations  directly,  as  numbers  of 
molecules  per  cubic  centimeter,  since  the  formulae  of  the  kinetic  theory 
can  then  be  immediately  applied  to  calculate  the  number  of  collisions  be- 
tween molecules  that  are  taking  place. 

The  quantity  k  occurring  in  equation  (539)  is  called  the  specific  re- 
action rate,  and  has  the  dimensions  of  [^— iC-^+3'+"+ — — ^j.  It  is  most 
conveniently  expressed  using  time  in  seconds  and  concentrations  in  mols 
per  cubic  centimeter,  and  it  is  most  unfortunate  that  the  literature  con- 
tains so  much  work  in  which  inconvenient  or  even  unspecified  units  are 
employed. 

285.  Many  reactions  proceed  throughout  the  major  part  of  their 
course  with  a  simple  and  constant  order,  the  exponents  being  constant 
integers,  and  the  specific  reaction  rate  k  being  also  a  constant.  The 
exponents  are,  however,  not  necessarily  integers  and  expressions  of  more 
complicated  form  than  (539)  are  sometimes  more  convenient.^  In  the 
case  of  a  reaction  of  constant  order,  the  specific  rate  k  will  be  a  constant, 
but  many  reactions  occur  in  which  the  order  varies  as  the  reaction  pro- 
ceeds. In  such  a  case  either  the  value  of  k  or  the  values  both  of  k  and 
of  the  exponents  x,  y,  z,  etc.,  may  be  regarded  as  functions  of  the  con- 
centrations if  we  desire  to  express  the  actual  facts  by  an  equation  of 
the  form  of  (539).  The  specific  rate  k  is  in  general  a  function  of  the 
temperature,  usually  increasing  as  the  latter  increases,  and  indeed  often 
doubling  or  trebling  for  a  10°  rise  in  temperature. 

286.  Relation  between  the  Mechanism  and  Order  of  Chemical 
Reactions. — By  the  mechanism  assigned  to  a  chemical  reaction  we  try 
to  express  the  actual  nature  of  the  individual  chemical  processes  which 
are  occurring  in  the  reaction  mixture.  Thus  we  call  a  reaction  uni- 
molecular  if  we  believe  that  only  a  single  molecule  of  the  reactant  is 
involved  in  the  individual  step,  as  for  example  in  radioactive  decomposi- 
tion, bimolecular  if  two  molecules  enter  into  the  individual   step,  as  in 

'  Thus  the  rate  of  the  reaction  H2  +  Br^  =  2HBr  can  best  be  expressed   by  an 
equation  of  the  form 

_  dC  ^  j^    (H,)  (5r.)i 


dt  l+,'(^> 
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the  combination  of  hydrogen  and  iodine  to   form  hydrogen  iodide,  and 
trimolecular,  quadriniolccular,  etc.,  for  larger  numbers  of  molecules. 

The  determination  of  the  actual  mechanisms  by  which  reactions  do 
take  place  is  of  course  often  a  very  uncertain  and  speculative  business. 
The  mechanism  postulated  must  of  course  be  one  which  will  explain 
the  order  of  the  reaction  as  experimentally  found,  and  any  other  facts 
that  we  can  command,  but  more  than  one  hypothetical  mechanism  may 
often  explain  all  the  facts  at  our  disposal  and  the  present  status  of  our 
experimental  knowledge  in  the  field  of  reaction  rates  is  all  too  meagre. 

287.  In  the  case  of  reactions  of  simple  and  constant  order,  the  obvious 
hypothesis  is  to  assume  a  direct  correspondence  between  order  and 
mechanism.  Thus  a  reaction  of  the  first  order  is  presumably  unimolecular 
in  character,  since  such  a  mechanism  would  evidently  lead  to  a  rate  of 
decomposition  proportional  to  the  number  of  molecules  present.  Similarly 
a  reaction  of  the  second  order  with  respect  to  any  component  presumably 
results  from  a  bimolecular  process,  since  the  number  of  collisions  between 
the  molecules  of  this  component  will  evidently  be  proportional  to  the 
square  of  the -concentration.  And  in  similar  fashion  we  can  proceed  to 
reactions  of  higher  orders. 

288.  This  assumed  correspondence  between  order  and  mechanism, 
which  finds  its  rational  explanation  on  the  basis  of  the  kinetic  theory 
of  molecular  motions,  was  of  course  the  original  hypothesis  of  Guldberg 
and  Waage,  in  the  formulation  of  the  law  of  mass  action.  Since  their 
time  several  circumstances  have  tended  to  obscure  the  clarity  and  funda- 
mentality  of  the  idea.  In  the  first  place,  it  is  often  found  that  the 
order  and  mechanism  of  a  reaction  do  not  correspond  to  the  usual 
stoichiometric  equation  which  is  most  naturally  written  to  express  the 
total  chemical  change.  In  the  second  place,  further  investigation  has 
often  shown  that  the  rate  of  a  reaction  originally  supposed  to  be  of  a 
simple  definite  order  really  depends  on  the  concentration  of  substances 
which  had  not  previously  been  varied.  And  finally  the  discovery  of  many 
reactions  which  are  catalyzed  by  traces  of  impurity,  such  for  example  as 
water  vapor,  has  unfortunately  led  some  investigators  to  become  fearful 
of  the  whole  basis  of  chemical  kinetics  and  to  suspect  that  catalytic  com- 
plications lurk  behind  every  molecule  like  prowling  wolves.  The  rational 
procedure,  however,  seems  to  be  to  regard  the  order  of  any  simple  re- 
action as  prhna-facie  evidence  of  its  mechanism,  and  not  introduce  com- 
plications until  these  are  made  necessary  by  the  facts.  In  addition,  it 
is  to  be  appreciated  that  even  in  the  case  of  a  complicated  set  of  simul- 
taneous reactions,  the  individual  steps  of  the  total  process  are  them- 
selves simple  reactions  of  definite  order  and  mechanism,  so  that  a  study 
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of   the  theory  of   isolated   reactions   is  the   natural   preliminary    for   the 
study  of  all  branches  of  chemical  kinetics. 

289.  A  couple  of  illustrations  of  the  possibility  of  drawing  pre- 
mature conclusions  as  to  the  order  and  mechanism  of  chemical  reactions 
will  be  of  interest. 

Thus  the  rate  of  transformation  of  a  given  sample  of  aceto-acetic 
ester  from  the  enol  to  the  keto  form  is  found  to  be  of  the  first  order 
with  respect  to  the  ester  present.  This  might  lead  to  the  conclusion  that 
the  transformation  was  a  simple  unimolecular  change.  It  has  been  found, 
however,  that  the  change  is  catalyzed  by  minute  traces  of  hydroxide  present 
and  will  not  proceed  at  all  if  kept  in  quartz  vessels  and  hence  free  from 
alkali.  Hence  we  must  conclude  that  some  intermediate  interaction  with 
hydroxide  is  one  of  the  steps  in  the  total  process. 

As  another  interesting  example,  the  decomposition  of  ozone  is  found 
to  be  of  the  second  order  with  respect  to  the  concentration  of  ozone.  It 
would  be  incorrect,  however,  to  conclude  that  the  reaction  is  the  bimolecu- 
lar  combination  of  two  molecules  of  ozone  to  form  three  of  oxygen,  since 
further  investigation  shows  that  the  reaction  is  inhibited  by  the  presence 
of  oxygen,  and  that  the  more  complete  formulation  of  the  rate  can  be 
expressed  by  the  equation 


dt  (O2) 

This  result  would  be  in  accord  with  the  mechanism  proposed  by  Jahn 

O3  +  O  >  20, 

where  the   first  reaction   is   assumed    fast   enough  to   maintain   the   con- 
centration of  the  monatomic  oxygen  at  its  equilibrium  value 

while  the  overall  rate  of  the  decomposition  is  determined  by  the  second 
reaction  in  accordance  with  the  expression 

_^)  =  .'(03)(0)  =  ..i^  =  .i^= 

It  is  of  course  to  be  recognized  that  this  result  does  not  prove  Jahn's 
mechanism,  but  shows  it  to  be  a  possible  one.  It  may  be  again  emphasized, 
however,  that  complications  such  as  the  above  do  not  destroy  the  funda- 
mental importance  of  the  relation  between  order  and  mechanism  postu- 
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lated  by  Guldberg  and  Waage,  since  our  attempt  will  always  be  to  ana- 
lyze the  total  set  of  reactions  into  separate  steps  of  simple  order  and 
mechanism.  Both  of  the  examples  illustrate  the  desirability  of  deter- 
mining the  effect  of  the  concentration  of  all  materials  present  on  the 
rate  of  a  reaction, 

290.  Unimolecular  Reactions  and  the  Fraction  Decomposed. — In 

the  case  of  unimolecular  reactions,  of  which  the  decomposition  of  gaseous 
nitrogen  pentoxide  is  one  of  the  best  examples,  it  is  found  at  any  given 
temperature  that  a  constant  definite  fraction  of  the  material  present  decom- 
poses in  each  second. 

For  unimolecular  chemical  processes  it  is  observed  in  general  that 
the  fraction  undergoing  change  per  second  increases  rapidly  with  the 
temperature.  From  this  it  seems  necessary  to  conclude  that  the  normal 
state  of  the  molecule  is  a  stable  one,  but  that  certain  higher  quantum 
states  of  the  molecule  are  unstable.  Hence  with  rising  temperature  we 
have  an  increased  rate  of  reaction,  since  the  tendency  to  go  over  into 
higher  states  increases  with  the  temperature  in  a  manner  which  can  be 
determined  from  statistical  mechanical  considerations.  This  matter  of  the 
temperature  coefficient  of  reaction  rates  will  form  the  special  subject  of 
the  next  chapter. 

In  the  case  of  radioactive  decompositions,  the  fraction  decomposing 
per  second  is  found  over  the  wide  temperature  ranges  that  have  been 
tested  to  be  constant.  The  most  natural  inference  would  be  to  conclude 
that  the  internal  motions  of  electrons  or  protons  in  the  atom  occasionally 
result  in  an  unstable  configuration  which  thus  leads  to  a  rate  of  decom- 
position in  accordance  with  the  laws  of  chance.  There  is  also  the  pos- 
sibility, which  has  been  advanced  by  Perrin,  that  the  decomposition  is 
due  to  the  occasional  absorption  of  very  penetrating  radiation  of  extremely 
short  wave  length  of  extraterrestrial  origin.  If  this  should  be  the  case, 
however,  the  radiation  would  have  to  be  enormously  more  penetrating 
even  than  the  penetrating  radiation  recently  studied  by  Kohlhorster  and 
by  Millikan  in  order  to  account  for  the  constancy  in  the  rate  of  de- 
composition which  has  been  found. 

291.  Bimolecular  Reactions  and  Number  of  Collisions. — In  the 

case  of  bimolecular  reactions,  we  naturally  inquire  into  the  relation  be- 
tween the  number  of  molecules  which  collide  and  the  number  of  times 
that  reaction  results.  The  exact  specification  of  what  we  shall  mean  by  a 
collision  in  this  connection  is  of  course  difficult.  Nevertheless,  experi- 
ment and  theoiy  alike  agree  in  confirming  the  supposition  that  the  forces 
around  neutral  atoms  and  molecules  fall  off  with  a  very  high  power  of 
the  distance.     Hence  as  a  first  approximation  it  will   not   be  unreason- 
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able,  just  as  in  the  kinetic  theory  of  gases,  to  consider  molecules  as 
nearly  rigid  spheres,  and  compare  the  number  of  times  reaction  occurs 
with  the  calculated  number  of  collisions  using  the  kinetic  theory  values 
of  diameters. 

For  the  number  of  collisions  between  molecules  of  the  same  kind 
occurring  per  cubic  centimeter  per  second  we  have 

=  2iVVV^  (540) 

where  A^  is  the  number  per  cubic  centimeter,  o  their  diameter,  and  in 
their  mass.  And  for  the  number  between  molecules  of  different  kinds, 
we  have 

Z  =  2N,N.o,.'  ^2nkT  !!^L±ii  ( 541 ) 

where  N-^  and  A^o  are  respectively  the  numbers  of  the  two  kinds  present 
per  cubic  centimeter,  Oio  the  average  diameter,  and  lUi  and  nu  their  masses. 
(See  Chapter  5.) 

For  purposes  of  calculation  in  connection  with  reaction  rates,  it  is  often 
convenient  to  express  the  above  results  in  terms  of  the  number  of  mols 
per  cubic  centimeter  which  suffer  collision  per  second ;  we  then  obtain  for 
the  two  cases 


4    =    10^8.2922    (^2 

A 


0'  ^5  (542) 


and 


_  =  10^,.„„C.Go./^T-jjr__i  (543) 


where  A  is  Avogadro's  number,  the  concentrations  are  in  mols  per  cubic 
centimeter,  the  diameters  in  centimeters  and  the  masses  expressed  as  ordi- 
nary molecular  weights. 

These  equations  apply  of  course  strictly  only  to  gases.  It  seems, 
nevertheless,  probable  that  we  can  safely  regard  them  as  giving  upper 
limits  to  the  numbers  of  colHsions  between  solute  molecules  in  a  solution. 

292,  Using  these  equations  for  the  number  of  collisions  and  sub- 
stituting values  for  the  molecular  diameters  of  the  order  of  10~^  to  10~'^ 
centimeters,  we  always  find  for  bimolecular  reactions  in  the  usual  case 
where  the  rate  has  a  high  temperature  coefficient  that  the  number  of  col- 
lisions enormously  exceeds  the  number  of  times  a  reaction  occurs.  The 
factor  is  so  great,  for  example  of  the  order  10^^  in  the  case  of  the  bimolecu- 
lar decomposition  of  nitrous  oxide  at  1000°  absolute  studied  by  Hinshel- 
wood  and  Burk,*  that  we  cannot  reasonably  conclude  that  the  failure  to 

*  Hinshelwood  and  Burk,  Proc.  Roy.  Soc,  106A,  284  (1924). 
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react  is  predominantly  due  to  lack  of  proper  orientation  of  the  molecules  at 
the  time  of  collision.  It  seems  more  reasonable  to  assume  either  that  the 
molecules  must  be  in  a  higher  quantum  state  to  react  on  collision,  or 
that  only  collisions  of  high  kinetic  energy  can  lead  to  reaction.  These 
matters  are  of  course  susceptible  to  statistical  mechanical  treatment  and 
will  be  considered  in  the  next  chapter. 

293.  In  the  case  of  bimolecular  reactions,  in  which  the  rate  does  not 
have  a  large  temperature  coefficient,  we  can  conclude  that  activation  to  a 
higher  quantum  state  is  not  necessary  and  hence  shall  expect  a  closer 
correspondence  between  number  of  collisions  and  number  of  times  a  re- 
action occurs.  Experimental  information  concerning  such  reactions  is  at 
present  necessarily  very  scarce,  owing  to  the  probability  of  a  high  rate 
comparable  to  the  number  of  collisions  and  owing  to  the  fact  that  the 
lack  of  appreciable  temperature  coefficient  makes  it  impossible  to  vary 
the  rate  in  the  customary  manner  and  thus  find  a  convenient  range  to 
work  in.  By  indirect  methods,  however,  it  has  been  concluded  by 
Bodenstein  and  Liicktmeyer  ^  that  the  rate  of  recombination  of  monatomic 
bromine  has  a  value  which  is  at  least  nearly  independent  of  the  tempera- 
ture and  of  the  concentration  of  other  gases  present,  and  that  about  one 
collision  out  of  1250  leads  to  recombination.  Such  a  factor  might  easily 
result  from  the  necessity  for  proper  orientations  of  the  atoms  at  the 
time  of  collision. 

294.  It  would  of  course  be  very  surprising  if  we  found  a  case  of 
bimolecular  reaction  in  which  the  rate  of  reaction  was  greater  than  the 
calculated  number  of  collisions.  The  interpretation  of  such  a  circum- 
stance could  only  be  that  the  range  of  action  of  the  "chemical"  forces 
surrounding  molecules  is  much  greater  than  we  had  suspected.  It  seems 
doubtful  if  such  a  state  of  affairs  will  be  found ;  it  is  very  important,  how- 
ever, that  calculations  similar  to  the  above  should  be  made,  whenever 
the  data  present  themselves. 

295.  Reactions  of  Higher  Order  and  Number  of  Multiple  Col- 
lisions.— It  is  a  significant  experimental  fact  that  reactions  in  which  the 
rate  is  of  higher  order  than  the  second  have  so  far  been  but  rarely 
encountered.  Even  in  the  case  where  the  stoichiometric  equation  for  the 
change  would  lead  us  to  expect  a  reaction  of  higher  order,  this  is  sel- 
dom found.  A  reasonable  explanation  of  this  fact  lies  in  the  prob- 
able circumstance  that  triple  or  higher  order  encounters  with  a  proper 
orientation  to  permit  a  reaction  between  the  molecules  will  be  vei^y  rare, 

"Bodenstein  and  Liicktmeyer,  Zeitschr.  fiir  Physik.  Chem.,lH,  233   (1924). 
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and  hence  the  chemical  change  that  does  take  place  will  usually  occur 
as  the  result  of  a  series  of  bimolecular  steps. 

Reactions  of  higher  order  than  the  second  are,  however,  sometimes 
found.  The  polymerization  of  cyanic  acid  to  cyamelide  is  a  third  order 
heterogeneous  gas  reaction,  the  absolute  rate  depending  on  the  surface 
of  the  solid  cyamelide  present  and  being  proportional  to  the  cube  of  the 
gas  pressure.  The  reduction  in  solution  of  ferric  iron  to  ferrous  iron  by 
stannous  chloride  is  a  third  order  reaction  which  apparently  takes  place 
in  accordance  with  the  mechanism 

2Fe+^^  +  Sn^"  -^  2Fe^^  +  Sn"+^+ 

and  the  reaction  in  solution  between  ferrous  iron  and  chlorate  ion  which 
is  catalyzed  by  acid  is  also  of  the  third  order,  being  proportional  to  all 
three  concentrations  of  ferrous  ion,  chlorate  ion,  and  hydrogen  ion. 

As  a  reaction  of  the  fourth  order,  we  have  the  interaction  of  bromide 
and  bromate  which  appears  as  though  dependent  on  the  step 

2W  +  Br-  +  BrOs"  -^  HBrO  +  HBrO, 

and  as  a  reaction  of  the  eighth  order  the  interaction  of  hydrochloric  and 
chloric  acids  which  appears  as  though  dependent  on  the  step 

4H^  +  2CIO3-  +  2C1-  -^  2H0CIO3  +  CL 

Four  very  interesting  gas  reactions  have  also  been  studied  which  are 
of  the  third  order  and  apparently  homogeneous.  These  all  involve  quite 
interestingly  the  same  molecule  NO.  The  stoichiometric  equations  for 
these  reactions  are  ^ 

2NO  +  Oo  =  2NO2 

2NO  -f-  do  =  2NOC1 

2NO  +  Bro  =  2NOBr 

2NO  +  2Ho  =  N2  +  2HoO 

The  first  three  of  these  gas  reactions  are  third  order  in  agreement  with 
the  mechanism  which  would  be  assumed  from  the  stoichiometric  equation. 

The  fourth  reaction  is  found  to  be  second  order  with  respect  to  NO, 
and  first  order  with  respect  to  Yio.  This  experimental  fact  would  be  in 
agreement  with  the  mechanism 

2NO  +  H2  =  N2  +  H0O2     (rate  determining  step) 

H2O2  +  H2  =  2H2O  ( fast  reaction) 

'Bodenstein,  Zeitschr.  f.  physik.  Chcm.,  100,  68  (1922).  Trautz,  Zeitschr.  f. 
anorg.  Chem.,  88,  285  (1914).  Trautz  and  Dalai,  Zeitschr.  f.  anorg.  Ohem.,  102^ 
149  (1918).     Hinshelwood  and  Green,  Jour.  Chem.  Soc,  129,  730  (1926). 
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or  also  with  the  mechanism 

2NO  +  H,  =  N2O  +  HoO    (rate  determining  step) 

N2O  4-  H2  =  N2  +  HoO        (fast  reaction) 

296.  Reactions  such  as  the  above  in  which  the  rate  is  certainly  found 
experimentally  to  be  of  higher  order  than  the  second  do  occur.  The 
interpretation  of  these  higher  orders  as  indicating  the  simultaneous  col- 
lision of  three  or  more  molecules  is,  however,  not  certain,  since  the 
existence  of  a  small  equilibrium  concentration  of  a  compound  formed  from 
the  reacting  substances  is  not  ruled  out,  and  this  compound  could  then 
enter  into  the  reaction  leaving  the  order  unchanged.  Thus  the  reaction 
between  three  substances  A,  B  and  C  to  form  their  final  products  might 
take  place  in  the  following  two  steps 

A-^B  :^  AB 

AB  -\-C  -^  Products 

Since  the  compound  AB  is  assumed  to  be  in  equilibrium  with  A  and  B, 
its  concentration  would  be  maintained  proportional  to  the  product  of  the 
concentrations  A  and  B,  and  the  overall  rate  of  the  reaction  would  be 
that  of  the  second  step.  The  observed  rate  of  the  reaction  would  thus 
be  of  the  third  order  in  spite  of  the  fact  that  it  is  brought  about  by  two 
bimolecular  steps. 

297.  The  question  of  intermediate  compound  formation  versus  col- 
lision between  three  or  more  molecules  is  to  some  extent  amenable  to 
theoretical  treatment.  If  the  molecules  of  substances  were  rigid  elastic 
spheres,  as  we  assume  for  a  first  approximation,  it  is  immediately  evi- 
dent that  the  simultaneous  collision  of  three  or  more  molecules  could 
never  occur,  since  the  time  during  which  two  molecules  remain  in  con- 
tact would  be  zero.  It  is  evident,  however,  that  for  purposes  of  estimat- 
ing the  chance  of  chemical  reaction,  we  are  not  interested  in  a  hypo- 
thetical instantaneous  contact  between  molecules,  but  rather  in  calculating 
the  number  of  times  per  second  that  the  molecules  which  are  to  react 
come  within  a  small  range  of  each  other. 

298.  To  investigate  this  problem,  we  shall  first  need  to  calculate  the 
mean  length  of  time  that  the  surfaces  of  tivo  colliding  molecules  remain 
within  a  short  distance  of  each  other  d.  This  distance  d  may  be  regarded 
as  the  separation  that  the  molecules  can  have  and  chemical  action  still 
be  possible,  and  the  length  of  time  during  which  they  remain  within 
this  distance  may  be  called  the  duration  of  collision.  We  shall  treat  the 
molecules  as  rigid  spheres  and  assume  that  d  is  small  compared  with  the 
diameters  of  these  spheres. 
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If  now  we  consider  a  collision  between  two  of  the  molecules  in  which 
the  component  of  relative  velocity  parallel  to  the  line  of  centers  has  the 
value  R,  it  is  evident  that  before  the  collision  the  two  molecules  will  be 
within  the  range  d  for  a  time  equal  to  d/R.  Furthermore  it  is  evident 
from  the  principle  of  microscopic  reversibility  that  on  the  average  the 
time  before  and  after  collision  during  which  the  molecules  are  within  this 
range  will  have  the  same  values.  Hence  to  determine  the  mean  duration 
of  these  collisions,  we  merely  have  to  determine  the  mean  value  of  2d/R 
for  all  collisions. 

We  have,  however,  in  accordance  with  equation  (123b)  in  Chapter  5 
for  the  fraction  of  all  collisions  in  which  the  component  of  relative 
velocity  parallel  to  the  line  of  centers  lies  in  the  range  R  to  R  -\-  dR,  the 
expression 

dZ  ntim-^       1 )»i»'j    _gl 

—;t-  =  ; TTF   e       nil  +  m^  zkT  KClK 

Z  Ml  -f-  W2   kl 

and  hence  we  can  obtain  the  mean  duration  t   for  all  collisions  by  an 
integration  over  all  values  of  R  in  accordance  with  the  expression 


zo     m,m2       1    ^___-_._^^^  ^244  >^ 


R  Wi  +  ^2  kT 

0 

Simplifying  and  performing  the  integration  we  easily  obtain 


or 


V^rf'T    ^  (545) 

yi  kT (nh. -\- m2) 

(546) 


in  the  case  the  two  molecules  have  the  same  mass. 
The  reciprocals  of  these  quantities 


7=1V 


kT{nh  +  W2) 


27imiW2 
and 

T         d  ^nm 


(547) 


(548) 


will  evidently  be  the  chance  per  unit  time  that  a  pair  of  two  molecules 
within  a  distance  <5  apart  will  break  up.  The  result  contains  a  small 
approximation,  since  it  neglects  the  behavior  of  molecules  which  come 
within  the  distance  d  without  actually  colliding. 
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299.  With  the  help  of  these  expressions  for  the  chance  per  unit  time 
that  a  pair  of  molecules  within  the  reaction  distance  6  will  break  up, 
we  can  now  determine  the  number  of  collisions  suitable  for  a  given 
reaction  which  will  occur  per  second  between  any  desired  number  of 
molecules.  To  do  this  we  first  calculate  by  the  laws  of  chance,  the  num- 
ber of  molecular  constellations  of  the  kind  that  could  lead  to  reaction 
which  are  present  in  the  system  at  any  time.  With  the  help  of  equations 
(547)  and  (548),  we  then  determine  the  chance  per  second  that  some 
one  of  the  necessary  pairs  of  molecules  in  the  constellation  will  break 
up,  and  by  multiplication  with  the  number  of  constellations  present  obtain 
the  rate  per  second  at  which  the  constellations  themselves  would  be  break- 
ing up  if  reaction  did  not  occur,  and  this  can  of  course  be  taken  as  the 
rate  at  which  they  are  forming. 

300.  The  above  method  can  best  be  illustrated  by  examples  since 
the  necessity  of  considering  the  particular  configuration  which  could  be 
expected  to  lead  to  reaction  in  any  given  case  makes  the  development  of 
general  formulae  not  very  practicable. 

As  a  hypothetical  example,  let  us  consider  a  system  containing  A^^, 
N^  and  TVg  molecules  per  cubic  centimeter  of  three  dififerent  kinds  having 
the  masses  7%,  m.,  and  m-^,  and  let  us  assume  that  reaction  could  be  ex- 
pected to  have  a  chance  of  taking  place  when  a  collision  occurs  in  which 
simultaneously  a  molecule  of  the  first  kind  is  within  the  distance  d  of  one 
of  the  second  kind,  and  this  in  turn  within  the  distance  d  of  one  of  the 
third  kind. 

If  now  we  single  out  a  given  molecule  of  the  first  kind,  it  is  evident 
that  the  chance  that  a  molecule  of  the  second  kind  will  lie,  within  the 
distance  d  from  it,  is  equal  to  the  ratio  of  the  volume  47iOi2^d  within  which 
the  center  of  the  second  molecule  must  lie  to  the  total  volume,  substantially 
unity,  in  which  it  may  lie,  multiplied  by  the  number  of  molecules  of  the 
second  kind  N.,.  Hence  for  the  number  of  pairs  present  per  cubic  centi- 
meter at  any  instant  we  should  expect 

N,,  =  N^NA7io,^^d  (549)* 

*  The  above  derivation  of  this  equation  contains  approximations  which  are  only 
valid  at  high  enough  dilutions.  We  may  proceed  more  rigorously  as  follows.  If 
we  single  out  a  specified  pair  of  molecules  of  the  first  and  second  kinds,  the  prob- 
ability that  they  will  lie  within  the  range  8,  all  other  molecules  of  the  first  and 
second  kinds  lying  outside  the  range  6  of  the  specified  molecules  of  the  second  and 
first  kinds  respectively,  will  be 


_  4  Ka\2b    /  1  —  4  K(fr2byNi  -  1)  /  1  —  4  n(f,,b\ 


(iV2—   1) 


The  molecule  of  the  second  kind,  however,  could  now  be  chosen  in  N2  independent 
ways  to  give  us  Ni  mutually  exclusive  configurations,  each  having  the  probability 
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The  chance,  however,  that  a  molecule  of  the  third  kind  will  be  within 
the  distance  of  any  given  molecule  of  the  second  kind  will  be  Ns,^7i02z'd. 
Hence  neglecting  the  steric  hindrance  of  the  molecules,  we  should  expect 
for  the  number  of  favorable  triplets  present  at  any  time  per  cubic  centi- 
meter 

A^,,3  =  NrNoN,  {Ano^o")  (471023')^-  (550) 

Such  a  triplet,  however,  will  be  broken  up  and  reaction  prevented  if 
either  of  the  pairs  12  or  23  breaks  up.  Hence  referring  to  equation 
(547)  for  the  chance  per  unit  time  that  a  pair  will  break  up,  we  can 
write  for  the  chance  per  unit  time  that  a  triplet  will  break  up 

T         0  ^       Z7imim2  o  yl       Zmn^nis 

The  product,  however,  of  the  number  of  triplets  present  by  their 
chance  of  breaking  up  evidently  gives  us  the  number  of  triplets  disin- 
tegrating and  reforming  per  second,  so  that  we  can  write 


Z  = 


for  the  number  of  triple  collisions  per  second  per  cubic  centimeter,  of  the 
particular  kind  which  might  lead  to  reaction.'^ 

To  use  the  above  expression  in  interpreting  a  given  reaction  of  the 
third  order,  we  shall  have  to  compare  the  quantity  Z  with  the  experi- 
mental rate 

^^'  =kN^N.Ns  (553) 


dt 

given  above,  so  that  the  probability  that  a  specified  molecule  of  the  first  kind  will  be 
thus  associated  with  any  one  of  the  second  kind  is 


N'. 


4 Jto'i^S    / 1  —  4 Ka'i,b\ (N^-\)/l  —  4 jta'i26yiV 


This,  however,  is  the  fraction  of  the  time  that  a  given  molecule  of  the  first  kind 
is  thus  associated  and  hence  the  number  of  such  pairs  we  may  expect  present  at 
any  time  is 

,,,,  4xo\.8   /I  — 4jt(r'i28\Wi-i)/l  — 4xa^i28\(A^2-n 

For  dilute  systems,  however,  the  above  expression  reduces  to  equation  549.  Fur- 
thermore, in  such  dilute  systems  the  number  of  pairs  in  which  more  than  one 
molecule  of  one  kind  is  associated  with  a  molecule  of  the  other  kind  can  be  neglected. 

Similar  considerations  apply  to  the  derivation  of  equation  550. 

'  This  treatment  may  be  compared  with  the  calculations  of  Smoluchowski,  Boltz- 
mann  Festschrift,  p.  626,  and  Syrkim,  Physik.  Zeitschr.,  24,  236  (1923),  where  for 
the  most  part,  however,  the  assumption  of  point  particles  has  been  made. 
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It  is  immediately  evident  that  the  reaction  can  be  accounted  on  the  basis 
of  triple  collisions  only  in  case  we  can  satisfy  the  relation 

Z  >  kN.N^Ns  (554) 

If  on  substituting  reasonable  values  of  the  order  10~^  centimeters  for 
Oi2  and  O03  we  do  not  find  it  possible  to  satisfy  this  inequality,  taking  <5 
as  a  very  small  fraction  of  the  molecular  diameter,  we  shall  conclude 
that  the  mechanism  of  the  reaction  cannot  be  one  depending  on  triple 
collisions,  but  must  necessarily  involve  preliminary  association  of  the 
molecules. 

301.  The  application  of  these  methods  to  actual  experimental  cases 
is  at  present  limited  by  the  nature  of  the  data  available.  Most  of  the 
higher  order  reactions,  that  we  know,  occur  in  aqueous  solution  between 
ionized  substances,  and  thus  introduce  difficulties.  The  hydration  of  the 
ions  which  presumably  occurs  may  be  expected  to  make  the  chances 
of  break  up  less  than  we  calculate  from  equations  (547)  and  (548), 
and  to  introduce  unknown  steric  hindrances.  In  addition  the  chance 
that  two  ions  will  be  within  a  distance  d  is  increased  if  the  ions  are  of 
the  opposite  sign  and  decreased  if  they  are  of  the  same  sign,  and  this 
would  have  to  be  allowed  for  by  the  methods  used  by  Milner  and  Debye 
in  developing  the  interionic  attraction  theory. 

As  an  experimental  case  to  use,  merely  by  way  of  illustration,  we  shall 
choose  the  gas  reaction  between  nitric  oxide  and  oxygen 

2NO  +  Oo >  2NO2 

and  shall  try  to  test  the  possibility  that  this  reaction  could  occur  as  the 
result  of  triple  collisions,  in  which  two  NO  molecules  are  simultaneously 
within  some  small  distance  d  from  an  oxygen  molecule. 

In  accordance  with  our  previous  method  of  treatment,  the  number  of 
such  triplets  present  per  cubic  centimeter  at  any  time  would  be 

N=^-^^  {Ano.^dy  (555) 

wr 

where  A'^i  and  N2  are  respectively  the  numbers  of  NO  and  NO2  molecules 

per  cubic  centimeter,  and  O12  is  the  mean  diameter  of  the  two  ions,  and 

the  reason  for  the  factor  1/2  is  evident. 

Such  a  triplet  can  be  broken  up  if  either  NO  molecule  leaves  the 

Oo  molecule  and  hence  in  accordance  with  equation  (547)  we  shall  have 

for  the  chance  of  destruction  per  unit  time 

1    _  2    jkTjnH  +  ^  _  2    fRTjM,  +  M,) 
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Combining  with  equation  (555),  we  get  for  the  number  of  triple  col- 
Hsions  per  second  per  cubic  centimeter  which  could  lead  to  reaction 

Taking  the  mean  diameter  Oio  as  3  X  10"®  centimeters,  the  temperature  as 
273°  absolute  and  substituting  for  AI^  and  J/o  the  molecular  weights  of 
NO  and  O2,  we  obtain 

Z=  1.95  X  10--*  <3.Vi-A'o     colhsions  per  second  (558) 

This  is  to  be  compared  with  the  value  obtained  by  Bodenstein  for  the 
rate  of  the  reaction  at  0°, 

^'^ "  =  2.37  X  10-'«  Ni'N.  (559) 


dt 

Comparing  (558)  and  (559)  we  see  that  the  value  of  <5  could  be  as 
small  as  10"^^  centimeters  and  still  have  the  number  of  collisions  greater 
than  the  rate  of  reaction.  Hence  our  calculation  does  not  rule  out  the 
possibility  of  triple  collisions  as  a  possible  mechanism  for  this  third-order 
reaction,  as  was  also  concluded  by  Bodenstein  (/.  c.)  using  a  different 
method  of  estimating  the  number  of  triple  collisions. 

If  we  assumed  such  a  mechanism,  then  the  small  negative  temperature 
coefficient  actually  found  for  the  rate  of  this  reaction  would  have  to  be 
accounted  for  by  the  hypothesis  that  triple  collisions  of  high  velocity  are 
less  likely  to  lead  to  reaction  than  those  of  low  velocity.  Such  an  assump- 
tion, however,  would  not  seem  out  of  the  question.  (Compare  Chapter 
17,  Section  231.) 

Although,  as  we  have  seen,  our  calculation  has  not  eliminated  the  pos- 
sibility of  triple  collision  as  the  mechanism  for  the  reaction  between  NO 
and  Oo,  nevertheless,  as  is  so  often  the  case  in  chemical  kinetics,  we  can 
not  conclude  that  this  is  the  actual  mechanism  of  the  reaction.  The  reac- 
tion may  be  due  to  intermediate  chemical  steps  of  which  we  now  have  no 
knowledge. 

302.  Heterogeneous  Reactions. — The  foregoing  treatment  of  order 
and  mechanism  applies  to  homogeneous  reactions  and  specially  to  those 
taking  place  in  gaseous  systems.  Some  discussion  must  now  be  given 
of  the  rates  of  heterogeneous  reactions.  This  is  a  field  in  which  experi- 
mental data  and  theoretical  interpretation  are  rapidly  accumulating,  and 
at  the  present  stage  of  development  it  will  be  wise  to  treat  only  a  few 
simple  cases.  We  shall  consider  primarily  reactions  which  occur  at  a 
solid-gas  boundary. 

In  the  case  of  heterogeneous  reactions,  two  factors  are  operative  in 
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determining  the  rate  of  change,  in  the  first  place  the  rate  at  which  the 
reacting  substances  come  to  the  boundary  where  the  reaction  occurs,  and 
in  the  second  place  the  nature  of  this  surface  and  the  mechanism  of 
the  changes  that  it  induces. 

In  the  last  chapter  we  discussed  the  first  of  these  factors  and  obtained 
a  simple  expression  (434)  for  the  rate  at  which  the  molecules  of  a 
gas  at  a  given  temperature  strike  a  surface  of  unit  area.  And  in  that 
chapter  we  applied  this  expression  to  investigate  the  rates  of  simple 
heterogeneous  processes  like  evaporation  and  solution  which  are  usually 
considered  as  physical  rather  than  chemical.  The  equation,  however,  is 
of  course  also  of  value  in  interpreting  the  rates  of  chemical  changes 
that  take  place  at  a  surface. 

As  to  the  second  factor  mentioned  above,  it  has  been  emphasized  by 
Langmuir,  in  agreement  with  our  knowledge  of  crystal  structure  as 
obtained  from  X-ray  analysis,  that  we  must  regard  the  molecules  at  the 
surface  of  a  solid  as  definitely  oriented,  forming  a  checkerboard-like 
arrangement  or  two-dimensional  lattice.  The  exposed  atoms  or  mole- 
cules on  this  surface  are  found  by  Langmuir,  as  perhaps  might  be 
expected,  to  have  unsaturated  fields  of  force  on  the  side  away  from 
the  solid,  and  hence  to  have  a  great  tendency  to  adsorb  gaseous  mole- 
cules, presumably  in  definite  positions  on  the  diflferent  elementary  spaces 
provided  by  the  two  dimensional  lattice,  to  form  a  layer  one  molecule 
deep.  Layers  of  greater  depth  apparently  do  not  form  until  the  gas 
is  brought  to  pressures  and  temperatures  near  the  condensation  point. 
Lender  these  circumstances  it  is  e\-ident  that  reactions  at  the  surface 
^\'ill  be  largely  determined  by  the  behavior  of  these  adsorbed  films  of 
unimolecular  depth.  And  we  shall  confine  our  further  considerations 
principally  to  cases  in  which  such  concepts  apply. 

303.  The  Rates  o£  Adsorption  and  Reevaporation. — We  shall  first 
consider  the  rates  of  adsorption  and  reevaporation  for  two  of  several 
possible  cases  discussed  by  Langmuir.  By  equating  the  rates  of  adsorp- 
tion and  reevaporation  in  these  cases,  we  shall  then  obtain  equations 
connecting  the  amount  of  gas  adsorbed  at  equilibrium  with  the  pressure 
of  the  gas. 

Case  I. — As  the  first  case  let  us  consider  adsorption  on  a  plane  sur- 
face containing  .Vo  elementary  spaces  per  square  centimeter,  each  capable 
of  holding  one  molecule  of  adsorbed  gas. 

For  the  number  of  molecules  striking  this  surface  per  second,  we 
shall  have  in  accordance  with  equation  (437)  of  the  preceding  chapter 


Z  =  ^===  (560) 

%  2:i)nkT 
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And  if  a  is  the  chance  (usually  nearly  unity)  that  a  molecule  will  con- 
dense when  it  strikes  an  uncovered  portion  of  the  surface,  and  -^  is  the 
fraction  of  the  surface  which  is  bare,  we  shall  have  for  the  rate  of 
adsorption 

^■ads.  =  GL^Z  (561) 

On  the  other  hand,  the  rate  of  reevaporation  will  evidently  be  propor- 
tional to  the  number  of  molecules  on  the  surface,  giving  us 

^evap.  =  vNo'&i  (562) 

where  v  is  the  chance  per  second  at  the  given  temperature  that  a  mole- 
cule will  reevaporate,  and  ??i  is  the  fraction  of  all  the  A^'o  spaces  which 
contain  an  adsorbed  molecule. 

We  evidently  have,  however,  for  the  total  surface 

t^-f^i=l  (563) 

and  for  the  number  of  molecules  A^  adsorbed  on  the  surface 

N=zNo^,  (564) 

Equating  (561)  and  (562)  to  obtain  the  condition  for  equilibrium,  and 
substituting  (563)  and  (564),  we  easily  obtain  for  the  number  of  mole- 
cules adsorbed  at  equilibrium 

(—)z 

N  =  N,       ^f'\  (565) 

Or  substituting  o  for  the  ratio  a/vNo  we  can  write 

.  =  .,^  (5«) 

Since  Z  is  related  to  the  pressure  of  the  gas  by  expression  (560),  this 
equation  gives  the  relation  between  the  number  of  molecules  adsorbed 
at  equilibrium  and  the  pressure  of  the  gas  at  a  given  temperature. 

For  the  adsorption  of  a  number  of  gases  on  mica  and  glass  this 
equation  was  found  by  Langmuir  ^  to  express  the  actual  experimental 
facts.  The  experimental  value  of  A^o  in  that  equation  was  for  all  gases 
found  to  be  somewhat  less  than  the  total  number  of  molecules  of  solid 
calculated  to  be  on  the  surface  of  the  glass  or  mica.  This  shows  that  the 
assumption  of  layers  not  greater  than  one  molecule  thick  is  justified  and 
indicates  that  some  of  the  surface  spaces  did  not  have  the  proper  fields  of 
force  to  produce  adsorption. 

'Langmuir,  Jour.  Amcr.  Chem.  Soc,  40,  1361   (1918). 
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The  quantity 

0  =  ~  (567) 

occurring  in  equation  (566)  is  capable  of  physical  interpretation,  a  is 
the  chance  of  condensation  when  a  molecule  hits  a  vacant  space  and  vNq 
is  the  rate  of  evaporation  from  one  square  centimeter  of  occupied  spaces. 
Hence  o  is  a  measure  of  the  binding  force  of  adsorption  and  has  been 
called  by  Langmuir  the  "relative  life"  of  the  molecules  on  the  surface. 
The  relative  life  of  the  molecules  on  glass  and  mica  was  found  to  be 
of  an  order  several  thousand  to  a  million  times  greater  than  the  relative 
life  of  the  same  molecules  on  the  surface  of  condensed  liquid  at  the 
same  temperature.  This  illustrates  the  great  strength  of  the  stray  fields 
of  force  from  the  solid  surface  and  explains  the  lack  of  tendency  to 
form  a  second  layer  on  the  solid  surfaces  except  when  near  the  con- 
densation point. 

Case  II. — As  a  second  case  let  us  consider  the  adsorption  of  a  diatomic 
gas  on  a  surface  such  that  each  atom  is  held  on  a  separate  one  of  the 
elementary  spaces,  and  assume  a  large  value  for  the  heat  of  dissociation 
so  that  the  rate  of  evaporation  of  individual  atoms  will  be  small  com- 
pared with  the  rate  of  evaporation  of  two  atoms  from  adjacent  spaces 
in  the  form  of  a  molecule. 

In  the  state  of  equilibrium  Langmuir  assumes  a  haphazard  distribution 
of  the  atoms  over  the  surface.  If  i?'  is  the  fraction  of  the  surface  that 
is  bare,  then  the  chance  that  any  given  space  is  vacant  is  •&  and  the 
chance  that  two  given  adjacent  spaces  are  vacant  ??",  so  that  the  rate  of 
adsorption  of  molecules,  which  can  only  occur  with  two  adjacent  spaces 
vacant  will  be  proportional  to  d^^.  On  the  other  hand  the  chance  of  re- 
evaporation  will  be  determined  by  the  chance  that  two  adjacent  spaces 
are  filled  and  hence  will  be  proportional  to  '^x^,  where  "^x  is  the  fraction 
filled.  Hence  equating  the  two  rates  to  determine  the  conditions  at 
equilibrium  we  obtain 

a§^Z  =  vJx^  (568) 

where  a  and  v^  are  constants  and  Z  is  the  rate,  given  by  equation  (560), 
at  which  molecules  strike  the  surface. 

For  the  number  of  atoms  adsorbed,  however,  we  can  substitute  No'&i 
and  for  the  number  of  molecules  A^  one  half  this  value,  giving  us 

No^x  .        2N  2N  , 

and  substituting  in  (568)  we  obtain 
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For  a  simple  case  of  practical  interest  where  the  fraction  covered  is  small 
compared  with  unity,  we  obtain 

There  is  evidence  to  show  that  this  type  of  atomic  adsorption  occurs 
for  certain  surfaces  and  gases,  and  we  shall  use  equation  (571)  in  in- 
terpreting heterogeneous  reaction  rates. 

For  w-atomic  molecules  condensing  in  n  separate  elementary  spaces 
we  obtain 

n    \vi  / 

304.  Reaction  between  a  Gas  and  Solid. — The  simplest  imaginable 
case  of  a  heterogeneous  chemical  change  would  perhaps  be  the  reaction 
between  a  solid  and  a  gas,  in  which  a  certain  fraction  of  the  molecules 
coming  up  to  the  surface  combine  with  the  solid,  the  product  formed 
being  immediately  removed  by  evaporation.  The  case  is  quite  nearly 
met  by  the  reaction  studied  by  Langmuir  "  between  a  hot  tungsten  fila- 
ment and  oxygen  to  form  the  yellow  oxide  WO3,  which  distills  ofif  as 
soon  as  formed.  The  rate  of  the  reaction  was  found  at  any  tempera- 
ture to  be  proportional  to  the  pressure  of  oxygen. 

Experiments  on  the  effect  of  oxygen  on  the  electron  emission  of 
the  filament  as  well  as  other  data,  indicated  that  the  filament  in  these 
measurements  was  almost  completely  covered  with  a  unimolecular  oxide 
film.  The  number  of  molecules  of  oxygen  hitting  this  film  per  second 
is  evidently  proportional  to  the  pressure  in  accordance  with  equation  (560), 
and  hence  if  we  assume  at  any  given  filament  temperature  that  a  definite 
fraction  of  the  impinging  molecules  react  together  with  oxygen  present 
in  the  film  to  form  WO3,  we  should  expect  the  rate  to  be  proportional 
to  the  pressure  of  the  oxygen  as  was  found  experimentally  by  Langmuir. 
The  fraction  of  the  impinging  molecules  reacting  was  found  to  increase 
with  the  temperature  of  the  filament  from  0.00033  at  1070°  K  to  0.15 
at  2770°  K.  The  fraction  reacting  was,  however,  unaffected  by  the  tem- 
perature of  the  gas  (i.e.  bulb  temperature),  the  number  reacting  at  a 
given  filament  temperature  being  always  proportional  to  the  number 
colliding  with  the  film  as  given  by  equation  (560). 

305.  Reaction  of  a  Gas  at  a  Solid-Solid  Interface. — In  case  the 
product  formed  by  reaction  between  a  solid  and  gas  is  itself  a  solid  not 
miscible  with  the  first,  we  must  regard  the  system  as  containing  three 
phases,   two   solid  and   one   gaseous.      For   such   a   system   it   can   easily 

"Langmuir,  Jour.  Amer.  Client.  Soc,  37,  1139   (1915). 
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be  shown  theoretically  that  the  reaction  rate  cannot  in  general  be  de- 
termined by  the  area  of  the  exposed  surfaces  of  the  two  solids,  but 
that  the  chemical  change  must  take  place  at  the  solid-solid  interface  be- 
tween them. 

As  a  typical  example  let  us  consider  the  reaction  between  copper 
oxide  and  hydrogen  in  accordance  with  the  equation 

H3  -f  CuO  =  HoO  -f  Cu 

Suppose  now  we  should  assume  that  the  two  opposing  reactions  which 
are  both  taking  place  at  equilibrium  had  rates  proportional  to  the  ex- 
posed surfaces  S^  and  ^'2  of  copper  oxide  and  copper,  and  to  the  impacts 
made  by  hydrogen  and  water  vapor  per  square  centimeter  of  surface,  we 
should  then  have  in  accordance  with  equation  (560)  for  the  number  of 
impacts,  the  condition  for  equilibrium 

p,S^  =  Kp.So  (573) 

where  iv  is  a  constant  and  p^  and  p^  are  the  partial  pressures  of  hydrogen 
and  water  vapor.  We  should  thus  find  that  the  ratio  of  pressures  de- 
pended on  the  ratio  of  the  exposed  surfaces.  We  know,  however,  from 
the  principles  of  chemical  equilibrium  that  the  ratio  of  the  pressures 
is  independent  of  the  amounts  of  the  two  phases  whenever  they  are 
present  in  appreciable  quantities.  Hence  we  conclude  that  at  equilibrium 
the  rates  of  the  two  reactions  can  only  depend  on  the  extent  of  the 
common  interface  between  the  two  solids,  and  that  the  reactions  take 
place  at  this  boundary.  We  are  thus  led  to  the  belief  that,  even  when 
equilibrium  has  not  been  reached,  reactions  in  such  cases  will  take  place 
at  an  interface.  Similar  conclusions  in  the  case  of  other  types  of  re- 
actions involving  two  separate  solid  phases  can  of  course  also  be  drawn. 
These  conclusions  have  been  satisfactorily  confirmed  by  the  measure- 
ments of  Pease  and  Taylor  ^°  on  the  rate  of  the  above  reaction,  by 
Jones  and  Taylor  ^^  on  the  rate  of  reduction  of  copper  oxide  by  carbon 
monoxide  and  by  many  other  examples  of  the  autocatalytic  effect  produced 
by  the  increase  in  interface  area  as  reactions  of  such  kind  proceed. 

306.  Catalytic  Reactions  of  Gases  at  a  Solid  Surface. — The  theory 
of  adsorption  to  form  unimolecular  layers  has  been  used  by  Langmuir 
in  interpreting  rates  of  gas  reactions  which  are  catalyzed  by  a  solid  sur- 
face. In  discussing  such  reactions  it  is  convenient  to  treat  the  two  limit- 
ing cases  in  which  the  surface  is  only  slightly  covered  by  adsorbed  gas, 
and  in  which  the  surface  is  nearly  completely  covered. 

"Pease  and  Taylor,  Journ.  Amer.  Chem.  Soc,  43,  2179  (1921). 
"Jones  and  Taylor,  Journ.  Phys.  Chcm.,  27,  623  (1923). 
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Case  I. — If  we  have  a  case  in  which  the  surface  of  the  catalyst  is 
only  slightly  covered,  we  shall  have  at  any  temperature,  in  accordance 
with  equations  (572)  and  (560),  for  the  number  of  molecules  of  ad- 
sorbed gas  of  any  kind  per  square  centimeter 

where  n  is  the  number  of  elementary  spaces  occupied  by  the  molecule 
in  question,  K  is  a  constant  and  p  is  the  partial  pressure  of  the  gas  in 
question. 

This  equation  can  immediately  be  used  to  interpret  a  number  of 
catalytic  reactions.  Thus  the  dissociation  of  hydrogen  by  a  hot  filament 
may  be  regarded  as  due  to  the  evaporation  of  atoms  which  are  held  on 
separate  elementary  spaces  on  the  filament  surface.  This  will  give  us 
n  =  2,  and  the  rate  is  actually  found  experimentally  to  be  proportional 
to  the  square  root  of  the  pressure.  As  another  example  the  experi- 
ments of  Bodenstein  and  Ohlmer  ^-  on  the  rate  of  reaction  between  car- 
bon monoxide  and  oxygen  on  crystalline  quartz  surfaces  can  be  inter- 
preted if  we  assume  that  the  carbon  monoxide  condenses  on  a  single 
elementary  space  and  the  oxygen  atomically  on  two  elementary  spaces, 
since  the  rate  was  experimentally  found  to  be  proportional  to  the  partial 
pressure  of  the  monoxide  and  the  square  root  of  the  partial  pressure  of  the 
oxygen.  This  is  just  what  we  should  expect  if  the  reaction  occurs  between 
an  adsorbed  carbon  monoxide  molecule  and  a  neighboring  oxygen  atom. 

The  mode  of  application  of  such  considerations  to  other  cases  where 
the  fraction  of  the  surface  covered  is  small  will  be  evident. 

Case  II. — If  we  have  a  case  in  which  the  surface  is  largely  covered, 
the  treatment  of  heterogeneous  catalysis  is  again  easily  handled. 

The  simplest  example  will  be  the  effect  of  an  adsorbed  layer  act- 
ing as  a  catalytic  poison.  Thus  the  strong  adsorption  of  oxygen  on  a 
tungsten  filament,  previously  mentioned,  makes  a  trace  of  oxygen  a 
catalytic  poison  for  the  decompositions  of  hydrogen,  methane  and  am- 
monia by  actually  preventing  these  gases  from  reaching  the  metal  sur- 
face. Only  after  the  oxygen  has  been  used  up  to  form  WO3  as  previously 
discussed  does  the  decomposition  of  the  other  gas  commence. 

As  a  more  complicated  case,  we  may  take  the  reaction  between  carbon 
monoxide  and  oxygen  on  a  platinum  surface.^^  At  low  temperatures,  500° 
to  700°  K,  and  pressures  of  the  order  of  0.1  mm.  or  less,  the  reaction 
rate  is  proportional  to  the  pressure  of  oxygen  and  inversely  proportional 
to  the  pressure  of  carbon  monoxide  and  has  a  high  temperature  coefficient. 

"Bodenstein  and  Ohlmer,  Zeitschr.  fiir  physik.  Chem.,  53,  175   (1805). 
"Langmuir,  Jour.  Amer.  Chem.  Soc,  37,  1139  (1915)  ;  Trans.  Faraday  Soc,  17, 
621   (1921-22). 
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At  higher  temperatures,  750°  to  1050°  K,  the  rate  is  practically  inde- 
pendent of  the  temperature.  With  an  excess  of  oxygen  the  rate  at  these 
higher  temperatures  is  proportional  to  the  pressure  of  the  carbon  monox- 
ide, and  with  an  excess  of  carbon  monoxide  proportional  to  the  pressure 
of  the  oxygen. 

These  facts  can  be  explained  if  we  assume  a  tendency  for  the  forma- 
tion of  unimolecular  layers,  in  the  case  of  oxygen  each  atom  being  held 
by  one  elementally  space  and  in  the  case  of  carbon  monoxide  each  mole- 
cule being  held  by  two  elementary  spaces  corresponding  to  the  two  un- 
saturated valences  of  the  carbon  atom.  We  must  further  assume  that 
the  reaction  occurs  only  when  an  adsorbed  oxygen  atom  is  hit  by  a  car- 
bon monoxide  molecule  from  the  gas,  and  that  at  low  temperatures  the 
surface  is  largely  covered  by  carbon  monoxide  and  at  high  temperatures 
covered  by  oxygen  in  the  presence  of  an  excess  of  that  gas  but  otherwise 
nearly  bare. 

If  ?^  is  the  fraction  of  the  surface  which  is  bare  and  d'^  the  fraction 
covered  by  carbon  monoxide,  we  may  equate  under  equilibrium  condi- 
tions the  rates  of  adsorption  and  evaporation  of  carbon  monoxide  in 
accordance  with  the  equation 

a,d^-Z,  =  v,§^  (575) 

On  the  other  hand  in  the  case  of  oxygen  we  may  equate  the  rate  of 
adsorption  a^&'Zi  to  the  rate  of  reaction  co,  since  we  shall  assume  that 
adsorbed  oxygen  atoms  remain  until  removed  by  reaction  with  carbon 
monoxide,  giving  us 

w  =  a.r~Z.,  (576) 

Furthermore,  we  can  obviously  also  express  the  rate  of  formation  of 
carbon  dioxide  by  the  equation 

2co  =  eZ,§,  (577) 

where  §2  is  the  fraction  of  the  surface  covered  by  oxygen  and  e  the 
chance  of  reaction  by  an  impinging  carbon  monoxide  molecule.  And 
finally  we  can  write  for  the  total  surface 

^-f  ^1  +  1^2  =  1  (578) 

These  four  equations  permit  a  general  solution  for  the  four  variables 
"&,  ■&X,  ^2  and  CO. 

For  the  low  temperature  case,  however,  we  can  assume  the  surface 
nearly  covered  by  the  monoxide,  giving  us  i>i  =r  1,  and  can  combine  equa- 
tions (575)  and  (576)  to  give  at  once 

a.  =  ^  (579) 
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Since  Z^  and  Z^  by  equation  (560)  are  proportional  to  the  pressures  of 
carbon  monoxide  and  oxygen,  we  thus  have  an  immediate  agreement  with 
tlie  experimental  facts.  The  large  temperature  coefficient  of  the  reaction 
rate  at  the  lower  temperatures  is  to  be  explained  by  the  increasing  frac- 
tion of  the  surface  left  free  of  carbon  monoxide  as  the  temperature 
rises. 

At  higher  temperatures,  in  the  presence  of  excess  of  oxygen,  the 
platinum  is  nearly  covered  with  oxygen  so  that  the  rate  is  determined 
by  the  number  of  collisions  of  carbon  monoxide  with  this  adsorbed  film 
and  is  hence  proportional  to  the  pressure  of  carbon  monoxide  as  ex- 
perimentally found.  At  higher  temperatures  in  the  presence  of  excess 
of  carbon  monoxide,  the  surface  is  largely  bare  and  the  rate  is  deter- 
mined by  the  rate  at  which  oxygen  strikes  the  surface,  since  any  oxygen 
that  condenses  remains  until  it  is  removed  by  reaction  with  carbon 
monoxide.  The  rate  is  then  proportional  to  the  pressure  of  oxygen  as 
experimentally  found. 

Almost  the  same  considerations  were  found  to  hold  for  the  reaction 
between  hydrogen  and  oxygen  on  platinum  surfaces,  and  similar  treat- 
ment can  be  successfully  given  to  a  number  of  other  catalytic  reactions. 

307.  Explosive  Reactions. — In  concluding  this  chapter  a  brief  men- 
tion may  be  made  of  the  beautiful  results  obtained  by  F.  W.  Stevens  ^^ 
on  the  velocity  of  explosion  waves  in  mixtures  of  carbon  monoxide  and 
oxygen.  The  experiments  were  performed  by  blowing  soap  bubbles  con- 
taining the  two  gases  in  different  proportions,  touching  off  the  mixture 
electrically  at  the  center,  and  following  the  propagation  of  the  luminous 
reaction  zone  photographically.  The  reaction  thus  takes  place  under  con- 
ditions of  constant  pressure. 

The  remarkable  result  was  found  that  the  speed  of  propagation  of 
this  explosive  wave  was  proportional  to  the  concentration  of  the  oxygen 
and  the  square  of  the  concentration  of  the  carbon  monoxide  in  accord- 
ance with  the  equation 

s  =  k{02){C0y  (580) 

This  result  presumably  bears  some  relation  as  yet  unknown  to  the  order 
which  would  be  assumed  from  the  stoichiometric  equation 

O2  +  2CO  =  2CO2 

The  whole  field  is  a  very  interesting  one  for  further  investigation.  It 
would  be  specially  interesting  to  investigate  the  temperature  coefficient 
of  such  speeds  of  propagation. 

"  F.  W.  Stevens,  Zeitschr.  des  Ver.  deutschen  Ingcnieiire,  No.  20,  p.  659  (1926)  ; 
Jour.  Amcr.  Chem.  Soc,  48,  1896  (1926). 


Chapter  21. 
The  Temperature  Coefficient  of  Reaction  Rate. 

308.  Introduction.  The  Arrhenius  Postulate. — In  the  preceding 
chapter  the  temperature  coefficient  of  reaction  rates  was  only  incidently 
mentioned  and  the  subject  will  be  treated  more  fully  in  the  present 
chapter.    The  discussion  will  apply  mainly  to  homogeneous  gas  reactions. 

As  already  mentioned  reaction  velocities  are  usually  found  to  have  a 
very  considerable  temperature  coefficient,  the  rate  often  doubling  or 
trebling  for  a  ten  degree  rise  in  temperature.  The  fundamentality  of 
this  fact,  however,  must  not  be  unduly  exaggerated,  since  it  applies  of 
course  to  the  data  already  made  available  by  observation,  and  reactions 
in  which  the  rate  cannot  be  controlled  by  temperature  would  be  difficult 
to  investigate  if  the  rate  happened  to  be  very  large  or  very  small.  Thus 
as  noted  in  the  preceding  chapter,  if  the  rate  of  bimolecular  reactions 
without  appreciable  temperature  coefficient  should  be  of  the  order  of  the 
number  of  collisions  occurring,  the  rates  would  be  too  fast  to  measure 
except  by  indirect  methods. 

To  explain  the  high  temperature  coefficient  of  known  reaction  rates, 
it  was  postulated  by  Arrhenius  that  the  molecules  involved  in  the  reaction 
are  unreactive  when  in  their  normal  states,  and  that  reaction  can  only 
occur  between  molecules  in  activated  states.  Since  the  concentration  of 
the  molecules  in  an  activated  state  would  be  proportional  to  their  total 
concentration,  this  postulate  would  leave  the  order  of  the  reaction  the 
same  as  though  molecules  in  their  normal  state  could  react.  Further- 
more, since  the  concentration  of  activated  molecules  would  increase  rapidly 
with  the  temperature,  we  should  thus  be  able  to  account  for  the  high 
temperature  coefficient  of  reaction  rate. 

This  hypothesis  of  Arrhenius  with  its  later  modifications  has  proved 
extraordinarily  suggestive.  The  activated  states,  thus  postulated  by  him 
long  before  the  modern  development  of  the  quantum  theory,  may  now 
be  identified  when  occasion  requires  with  higher  quantum  states  of  the 
molecule.  The  process  of  activation  in  other  cases,  however,  can  con- 
sist in  a  dissociation  or  combination  of  a  kind  more  familiar  in  the  older 
chemistry.  In  addition,  as  we  shall  presently  show,  it  is  possible  to 
calculate  on  the  basis  of  the  hypothesis  the  average  energy  of  the  acti- 

^59 
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vated  molecules  from  a  knowledge  of  the  temperature  coefficient  of  the 
reaction  rate  itself. 

309.  The  chief  obstacle  at  present  encountered  by  the  Arrhenius  pic- 
ture of  the  mechanism  of  chemical  reaction  lies  in  the  difficulty  of  dis- 
covering processes  by  which  sufficient  energy  can  be  transmitted  to  mole- 
cules in  their  normal  states  rapidly  enough  so  that  the  quota  of  activated 
molecules  is  replenished  as  fast  as  is  necessary  to  account  for  observed 
rates  of  reaction.  We  shall  give  special  attention  to  this  feature  of 
the  problem  in  a  later  part  of  the  chapter.  Whether  this  difficulty  means 
that  the  Arrhenius  picture  must  be  abandoned  or  subjected  to  further 
serious  modification  is  not  now  evident.  It  may  be  mentioned  in  advance, 
however,  that  other  quantum  processes  are  certainly  known,  in  which 
energy  is  supplied  at  a  rate  far  too  fast  to  be  easily  explained. 

310.  We  shall  first  investigate  the  relation  between  temperature  co- 
efficient and  energy  of  activation  which  would  be  expected  on  the  basis 
of  the  Arrhenius  hypothesis.  To  do  this  it  will  be  advantageous  to  give 
separate  treatments  to  four  possible  different  cases :  ( 1 )  unimolecular 
reactions  in  which  the  rate  of  activation  is  fast  enough  to  maintain  the 
equilibrium  concentration  of  activated  molecules;  (2)  unimolecular  re- 
actions in  which  the  rate  of  activation  is  not  fast  enough  to  maintain 
the  equilibrium  concentration  of  activated  molecules;  (3)  bimolecular 
reactions  between  previously  activated  molecules;  and  (4)  bimolecular 
reactions  in  which  the  energy  of  activation  is  the  kinetic  energy  of  the 
colliding  molecules  themselves. 

311.  Unimolecular  Reactions  in  Which  the  Activated  Molecules 
Are  Maintained  at  the  Equilibrium  Concentration. — Let  us  consider 
a  first-order  unimolecular  reaction  in  which  the  rate  at  which  molecules 
can  be  activated  is  so  high  compared  with  the  rate  at  which  the  activated 
molecules  decompose  or  otherwise  react  that  the  equilibrium  concentra- 
tion of  molecules  in  the  activated  states  is  always  maintained. 

For  the  number  of  molecules  in  any  activated  state  a,  we  can  then 
write  in  accordance  with  the  Maxwell-Boltzmann  distribution  law 

N.  =  Mh£^  (581) 

i 

where  N  is  the  total  number  of  molecules,  pa  the  a-priori  probability  of 
the  state  a,  Sa  the  energy  of  a  molecule  in  that  state,  and  the  summation 
in  the  denominator  is  to  be  taken  over  all  states. 

If  now  we  assign  to  the  molecules  in  each  activated  state  a  a  chance 
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of  spontaneous  reaction  of  magnitude  ka  per  second,  we  should  obtain 
for  the  total  rate  of  reaction 

6a 

.J  Nl.kapae      kT 

—  EL  =  -^ (582) 

dt  V.   -  — 

Z^piC      kT 

* 

where  the  summation  in  the  numerator  is  to  be  taken  over  all  the  states  a 
which  contribute  appreciably  to  the  total  rate  of  change. 

It  is  evident  that  the  postulated  mechanism  leads  to  a  first-order  re- 
action as  assumed,  the  specific  rate  of  reaction  being 

6a 

^kapae      'kf 

k,  =  ^ -jr  (583) 

^pie    kT 

i 

An  actual  knowledge  of  the  values  of  ka  for  the  different  activated  states 
would  now  permit  a  theoretical  prediction  of  the  actual  specific  rate. 
This  is  of  course  for  the  present  beyond  our  powers.  We  can,  however, 
obtain  information  as  to  the  temperature  coefficient  of  the  specific  re- 
action rate  by  differentiating  with  respect  to  T.  It  will  be  most  advan- 
tageous to  take  a  logarithmic  differentiation,  giving  us 


d  log  ki        1  dki 

^    ^kapae      ''"t-^2 

dT      ~  k^dT  ~ 

k                             ^* 

fct 


s^.  ^T-^ 


kT  Hpie    kT 


(584) 


The  first  term  on  the  right-hand  side  of  equation  (584)  is  evidently 
the  sum  of  the  values  of  Sa/kT^  for  the  molecules  that  react  divided  by 
the  number  so  reacting  and  hence  gives  the  average  energy  of  the  mole- 
cules that  enter  into  reaction.  The  second  term  on  the  other  hand  gives 
the  average  energy  of  all  the  molecules,  so  that  we  may  rewrite  equation 
(584)  in  the  simple  form 


d  log  ki  £  —  £ 

dT      ~    kT^ 


(585) 


where  £  is  the  average  energy  of  the  molecules  that  react  and  £  is  the 
average  energy  of  all  the  molecules. 

312.  In  the  cases  of  usual  interest,  the  average  energy  of  all  the 
molecules  and  the  average  energy  of  the  imactwated  molecules  will  not 
be  appreciably  different,  since  very  few  molecules  will  be  in  the  activated 
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states  because  of  their  high  energy  content.  Hence  the  difference  (e  —  e) 
will  be  the  average  energy  that  has  to  be  supplied  to  the  molecules  that 
react  and  may  be  called  the  energy  of  activation.  We  thus  see  that  for 
this  case  the  temperature  coefficient  of  reaction  rate  immediately  gives 
the  energy  of  activation. 

313.  Unimolecular  Reactions  in  Which  the  Rate  of  Activation 
Is  Not  Fast  Enough  to  Maintain  the  Equilibrium  Concentration  of 
Activated  Molecules. — We  must  next  consider  first-order  unimolecular 
reactions  in  which  the  rate  of  reaction  is  so  fast  compared  with  the 
rate  at  which  molecules  can  be  activated  that  the  equilibrium  concen- 
tration of  activated  molecules  is  not  maintained.  As  a  limiting  case  this 
includes  the  merging  of  the  processes  of  activation  and  reaction,  every 
molecule  that  is  activated  entering  into  the  reaction. 

314.  In  order  to  carry  out  our  treatment  we  shall  first  need  to  obtain 
an  important  relation  betzvcen  the  rates  of  activation  and  deactivation  of 
molecules,  which  can  be  derived  with  the  help  of  the  principle  of  micro- 
scopic reversibility. 

Consider  a  system  in  equilibrium,  and  let  Ni  and  A'^a  be  the  number  of 
molecules  per  cubic  centimeter  in  two  states  i  and  a.  For  convenience 
we  may  call  i  the  initial  state  and  a  an  activated  state.  Now  we  may 
obviously  put  the  rate  at  which  molecules  are  passing  from  state  i  to 
state  a  proportional  to  the  number  in  state  i  in  accordance  with  the 
equation 

Zia  =  ^iaNi  (586) 

where  we  may  call  /5ia  the  coefficient  of  activation. 

Consider  now  the  reverse  processes  by  which  molecules  pass  from 
state  — a  to  — i.  Evidently  the  rate  of  such  reverse  processes  will  be 
proportional  to  the  number  of  molecules  N.a  in  the  state  —  a,  so  that  we 
can  write  for  the  number  of  molecules  passing  per  second  from  state 
—  a  to  — i 

Z.a-i  =  a.a-i  N_a  (587) 

where  a.a-i  may  be  called  a  coeificient  of  deactivation. 

In  accordance  with  the  principle  of  microscopic  reversibility,  how- 
ever, these  two  rates  must  be  equal  if  the  system  is  at  equilibrium  so 
that  we  can  write 

^,aNi  =  a.a.iN.a  (588) 

The  two  concentrations,  however,  are  connected  in  accordance  with 
the  Maxwell-Boltzmann  distribution  law  by  the  equation 

^^tle~Tf  (589) 

Ni         pi 
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where  p-a  and  pi  are  the  a-priori   probabihties  of  the   two   states   and 
£{.0  is  the  difference  in  energy  of  the  two  states. 
Substituting  in  equation  (588)   we  obtain 

.        ^ia  =  a.a.i^e-Tf  (590) 

pi 

as  our  relation  between  the  coefficients  of  activation  from  states  i  to  a, 
and  deactivation  from  states  — a  to  — i. 

The  a-priori  probabiHty  p.a  is,  however,  evidently  equal  to  the  a-priori 
probability  pa  of  its  reverse  state  and  the  energy  difference  £i-a  between 
states  i  and  — a  is  evidently  equal  to  the  energy  difference  e,a  between 
states  i  and  a.  This  permits  us  to  rewrite  equation  (590)  in  the  more 
convenient  form 

^^,=  a.a-i^e-if  (591) 

pi 

This  important  equation  makes  no  assumption  as  to  the  mechanism 
of  the  activational  and  deactivational  processes,  and  the  coefficients  ^la 
and  a.a-i  may  be  dependent  on  the  total  concentration  of  molecules 
present  if  the  mechanism  is  collisional,  or  on  the  concentration  of  radiant 
energy  of  different  frequencies  if  this  is  involved,  or  on  whatever  factors 
may  be  operative.  The  equation  may  be  regarded  as  a  generalization  of 
the  relation  between  probabilities  of  absorption  and  emission  discussed 
in  Chapter  16,  and  the  relation  between  probabilities  of  collisions  of  the 
first  and  second  kind  discussed  in  Chapter  17. 

It  must  be  noted  that  equation  (591)  has  been  rigorously  derived 
only  for  a  system  in  statistical  equilibrium.  We  should  usually  expect, 
however,  that  the  specific  rates  at  which  molecules  pass  from'  one  state 
to  another  will  not  be  greatly  affected  by  a  deviation  of  the  system  from 
equilibrium.  In  what  follows  we  shall  use  equation  (591)  as  valid  at 
any  stage  of  the  reaction.  This  is  the  less  questionable  since  it  only 
necessitates  the  constancy  of  the  ratio  /?io  to  a.a-i,  rather  than  the  in- 
dividual constancy  of  the  two  terms. 

315.  With  the  help  of  this  equation  we  are  now  ready  to  treat  the 
more  general  case  of  first-order  unimolecular  reactions  where  the  rate 
of  activation  is  not  necessarily  fast  enough  to  maintain  the  equilibrium 
concentration  of  activated  molecules. 

For  the  rate  at  which  molecules  are  being  activated  we  may  write 

^^  =  22^iaiV,  (592) 
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where  the  summation  is  to  be  taken  over  all  inactive  states  i  and  activated 
states  a. 

A  certain  fraction  of  the  molecules  which  arrive  in  any  activated 
state  a  may  react  and  the  remainder  return  into  unactivated  states.  For 
the  fraction  ■d'a  that  react  we  may  write 

2 
^a=  ^     ^    ?% (593) 

2j  aaf  -\-2.  Oai 
f  i 

where  "^aaf  is  the  total  specific  rate  at  which  molecules  pass  from  the 

active  state  fonmrd  to  a  final  unactivated  state  of  the  product,  and 
"Eaai  is  the  total  specific  rate  of  deactivation  back  to  any  initial  state  i. 

In  case  the  unimolecular  reaction  is  an  isomeric  change  the  final  states  / 
will  be  those  of  a  single  molecule,  but  in  the  case  of  dissociation  / 
stands  for  a  simultaneous  condition  of  more  than  one  molecule.  The 
formulation  neglects  the  passage  of  a  molecule  from  one  activated  state 
to  another  activated  state,  but  this  is  not  serious. 

Combining  equation  (592)  for  the  rate  of  activation  with  the  frac- 
tion of  activated  molecules  which  react,  we  may  now  write  for  the  rate 
of  the  unimolecular  reaction 

-^  =  22^./?,„iV,  (594) 

316.  This  equation  contains  no  hypothesis  except  our  general  pic- 
ture of  the  mechanism  of  unimolecular  reaction,  and  if  we  knew  the 
value  of  the  factors  we  could  use  it  to  calculate  the  actual  rate.  This 
is  of  course  not  now  possible  but  we  can  use  the  equation  to  obtain 
information  as  to  the  temperature  coefficient  of  the  reaction  rate. 

For  ^ia  we  can  substitute  the  expression  derived  above,  given  by 
equation  (591),  and,  assuming  statistical  equilibrium  for  the  unactivated 
molecules  as  seems  justifiable,  we  can  substitute  for  Ni  the  usual  Maxwell- 
Boltzmann  expression.     We  obtain 

...       22  ^a  a.a.i^  e-Ff  NpiC-^ 
^dN^^ Pa _ (595) 

"^^  ^PiC-kf 

i 

where  A^  is  the  total  number  of  molecules  in  unactivated  states,  which 
is  sensibly  equal  to  the  total  number  of  molecules  that  have  not  reacted. 
Noting  that  we  have  the  obvious  relation  eia  ^=  Sa  —  si  we  can  rewrite 
this  in  the  form 
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eo_ 

22  ^a  a.a-ipa  e      kT 

~^  =  — :zK — ^  ^^^^^ 

2  pie    kT 

i 

and  obtain  for  the  specific  reaction  rate 

22  -^a  a-a-ipa  e~"kf 

h  =  ^ -j^-  (597) 

2  pie    kT 

i 

Taking  as  in  the  previous  treatment  a  logarithmic  differentiation  of 
the  specific  reaction  rate  with  temperature,  we  can  now  write 


^^  „  -^/dhg^a    ,    dloga.a-i    .     Sg    \ 
^   1  22  #.  «.„■.^  .    .7-  ^— —  +         ^^       +  -^j 

dT  kx  dT        kx 


d  log  ki        1   dki 


2  pie    kT' 


^Jie    kT^ 
2  pie    kT 


(598) 


It  is  evident   from  the  principle  of  averaging,   however,   applied   in 
the  same  manner  as  in  Section  311,  that  this  can  be  rewritten  in  the  form 


dlogkx  __  dlog'&a       dloga.g.i  Eg  Sj  r^QQ^l 

dT      ~     dT      '^        dT      '^  kT'       kT'  ^       ^ 

where  the  double  line  indicates  an  average  for  the  molecules  that  actually 
enter  into  reaction,  and  the  single  line  indicates  the  average  for  the  un- 
activated  molecules. 

317.  In  using  this  expression  to  interpret  actual  temperature  co- 
efficients, it  should  be  noted  that  the  first  two  terms  on  the  right-hand 
side  of  this  equation  are  presumably  small.  Referring  to  equation  (593) 
for  the  value  of  '&a  we  see  that  both  these  two  first  terms  depend  merely 
on  the  temperature  coefficients  of  the  rates  at  which  molecules  fall  from 
higher  to  lower  quantum  states.  Such  deactivations  can  occur,  however, 
only  by  collisions  of  the  second  kind  with  other  molecules  or  by  the 
emission  of  radiation. 

The  first  of  these  processes  cannot  have  a  large  temperature  co- 
efficient, since  the  number  of  collisions  is  proportional  to  T^'^  and  hence 
at  the  temperatures  of  interest  does  not  change  rapidly.  This  conclusion 
might  have  to  be  modified,  however,  if  the  quantum  state  of  the  deactivat- 
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ing  molecule  should  be  of  major  importance  in  determining  the  probability 
of  a  deactivating  collision. 

The  second  process  can  only  take  place  by  spontaneous  and  induced 
emission  and  can  also  have  only  a  small  temperature  coefficient.  To 
show  this  we  note  in  accordance  with  Chapter  16  that  we  have  for  the 
combined  rate  of  emission  by  the  two  processes 

a,,  =  Bai^^\  '^-kl I  (600) 


V  cTr-l) 


and  by  simplification  and  differentiation  obtain 

d  log  aai hv  1 


dT  kT^    JiL       , 

ekT —  1 


(601) 


hv 

Developing  e  Jf  into  a  series,  however,   we  see  that  the  value   of   this 
can  never  be  greater  than  l/T. 

Hence  in  the  cases  of  special  interest  where  the  temperature  co- 
efficient of  the  reaction  rate  is  large,  we  may  neglect  the  first  two  terms 
in  (599)  and  again  obtain  the  simple  equation  for  temperature  coefficient 
found  in  Section  311 

rfiogfe,  _£--j 

dT      ~   kT^  ^     ^^ 

where  e  —  e  is  the  energy  of  activation  or  average  energy  that  has  to  be 
supplied  to  the  molecules  that  react. 

318.  Bimolecular  Reactions  between  Previously  Activated  Mole- 
cules.'— It  is  even  simpler  to  obtain  the  temperature  coefficient  for  reaction 
rate  in  the  case  of  second  order  bimolecular  reactions. 

If  the  reaction  results  from  the  collision  of  molecules  which  are  already 
activated  in  higher  quantum  states,  it  is  evident  that  the  reaction  can 
remain  second  order  over  a  range  of  concentrations,  only  if  the  frac- 
tions of  the  two  kinds  of  molecules  in  their  activated  states  remain  con- 
stant. Towards  the  end  of  a  second  order  reaction,  however,  the  frac- 
tion of  either  component  removed  per  second  by  reaction  becomes  very 
small  so  that  we  are  forced  to  the  conclusion,  in  the  case  of  such  a 
mechanism,  that  the  process  of  activation  is  fast  enough  at  all  stages  of 
reaction  to  maintain  the  equilibrium  concentration  of  activated  mole- 
cules. Hence  if  A'^  and  N'  are  the  numbers  of  molecules  of  the  two 
kinds  per  cubic  centimeter  we  can  write  for  the  concentrations  of  activated 
molecules  in  any  activated  states  a  and  a',  the  Maxwell-Boltzmann 
expressions 
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Npae~kf            .       , .  ,       N'pa'e~W 
Na  = —      and       N/  =  — ^  (603) 

^pie~kf  '2pi'e~W 

i  i 

Furthermore,  since  the  number  of  collisions  between  molecules  is 
proportional  to  the  square  root  of  the  absolute  temperature  T,  we  may 
evidently  write  for  the  rate  of  the  bimolecular  reaction 

-^  =  V/^^^Ka'NaNa'  (604) 

dt  a  a' 

where  kaa  is  a  constant  for  each  pair  of  activated  states  a  and  a\  depend- 
ing on  the  factors  determining  collision  frequency  given  in  equation  (541) 
of  the  last  chapter,  and  on  the  chance  of  reaction  when  molecules  in 
these  particular  activated  states  collide.  Substituting  expressions  (603) 
and  solving  for  the  specific  reaction  rate  k2,  we  obtain 

'  NN'  dt        '  -  ^  ^  ^"^  ^       ^ 


h.pie-lk\{Y^p,'e-l^i\ 


319.  Here  again  we  do  not  have  the  necessary  information  to  use 
this  equation  for  the  direct  prediction  of  specific  reaction  velocities,  but 
by  differentiation  can  obtain  information  as  to  the  temperature  coefficient. 
Carrying  out  a  logarithmic  dififerentiation  with  respect  to  the  tempera- 
ture T ,  we  obtain 

,,         ,  ,,         ^,,  ,     T/2^^kaa'papa'e-  ^^  ^'  ^  f ' 

d  log  ko  _  a  log  T/2        1         „  „-       '   '^  kT^ 


dT  dT 


+ 


(^Pie-Pr\(^pi'e-Ir) 


— '- ^  (606) 


llpie    kT  Hpi'e   kT 

%  i 

This,  however,  can  evidently  be  rewritten  in  the  form 


d  log  fer  _  J_    ,    gg  +  gg'  —  gj  +  gj'  (607^ 

dT      ~  2T~^  kT^  ^       ^ 

where  the  double  line  indicates  an  average  taken  for  the  molecules  that 
actually  react  and  the  single  line  an  average  taken  for  all  molecules. 
Here  again,  since  the  average  energy  of  all  the  molecules  and  the  average 
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energy  of  the  molecules  in  unactivated  states  is  approximately  the  same, 


the  quantity  Ca  -|-  Sa  —  ei  -{-  si'  will  be  the  average  energy  that  has  to  be 
supplied  to  the  molecules  that  react,  so  that  we  again  have  a  simple  relation 
between  temperature  coefficient  of  reaction  rate  and  energy  of  activation. 

320.  Bimolecular  Reaction  in  Which  the  Molecules  are  Acti- 
vated by  Collision.— Another  possibility  for  bimolecular  reaction  would 
be  that  normal  molecules  will  react  provided  they  collide  with  sufficient 
relative  velocity.  Such  a  mechanism  could  be  regarded  as  merging  the 
processes  of  activation  and  reaction. 

Assuming  this  mechanism  we  cannot  of  course  specify  the  exact 
nature  of  a  collision  that  would  lead  to  reaction ;  it  would  seem  reason- 
able, however,  to  make  it  dependent  on  the  relative  velocity  of  the  mole- 
cules parallel  to  their  line  of  centers  at  the  time  of  impact.  We  have, 
however,  in  accordance  with  equation  (123b)  in  Chapter  5  for  the  num- 
ber of  collisions  in  which  the  component  of  relative  velocity  parallel  to 
the  line  of  centers  lies  in  the  range  R  to  R  -\-  dR 

m-inio  1  minis        R- 

c[Z  =  Z        ',-     -^  e~  '■^m^  -2kT  RdR  (608) 

nix-]-  nvn  kl 

where  Z  is  the  total  number  of  collisions,  and  hence  on  the  basis  of  the 
assumed  mechanism  could  write  for  the  rate  of  reaction 

00 

_dN__  m^m-i  f  ^ "'^""'    _^ 


dt  kT{nix  -f-  m2) 


UrC       mi  +  m^2lT    RdR  (609) 


0 


where  A-r    is  the  chance  of  reaction  if  the  component  of  relative  velocity 
has  the  value  R. 

321.  Without  an  actual  knowledge  of  Ur  this  equation  cannot  of 
course  be  used  to  predict  actual  reaction  rates.  We  can,  however,  obtain 
information  from  it,  as  to  the  temperature  coefficient  which  would  be 
expected  on  the  basis  of  the  assumed  mechanism.  Since  the  total  num- 
ber of  collisions  Z  is  proportional  to  the  product  A^^A^g^'''"  we  can  obtain 
from  (609)   for  the  specific  reaction  rate 

00 

kRe~  m,  +  m,  2kT    RdR  (610) 

0 

where  A  is  a  constant. 

Taking  a  logarithmic  differentiation  with  respect  to  temperature  we 
obtain 
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»  log  ^2  _       J_   I  -/Q V  w^i  +  ^«2  2^7^  / 

rfr     -~2r"*"  ^^ (611) 

The  second  term,  however,  is  evidently  the  average  of  the  quantity  in 
parentheses  for  the  molecules  that  react,  so  that  we  can  rewrite  (611)  in 
the  form 


miWio     R^ 

-7i kl 


d  log  ^2  _    ,       1        ■     ■?»!  +  ^2     2 

dT      ~^  2T  ~^  kT-  ^       ■^ 

It  can  easily  he  shown,  however,  from  the  laws  of  the  conservation  of 

,                         ,           nunio      R~  .      ,  .  ,  .      . 

energy  and  momentum  that  — •  -^y-  is  the  maxnnum  kmetic  energy 

available  for  transformation  into  other  forms  by  a  head-on  collision  of 
two  particles  with  the  relative  velocity  R.  Furthermore,  it  can  also 
easily  be  shown  that  kT  is  the  average  value  of  the  above  quantity  for 
all  collisions.  So  that  the  difference  can  be  regarded  as  the  average 
energy  of  the  collisions  that  lead  to  reaction  minus  the  average  energy 
of  all  collisions  and  this  can  again  be  called  the  energy  of  activation,  so 
that  we  have  obtained  a  relation  between  temperature  coefficient  and  energy 
of  activation  entirely  similar  to  (607)  of  the  preceding  section. 

322,  Summary  as  to  the  Relation  betw^een  Temperature  Coeffi- 
cient and  Energy  of  Activation. — Examining  the  results  of  the  pre- 
ceding sections,  we  see  that  for  all  the  mechanisms  considered  we  can 
write  for  the  temperature  coefficient  of  the  specific  reaction  rate  of  first- 
order  unimolecular  reactions 

d  log  k^       e  —  e  . 

and  for  the  specific  reaction  rate  of  second-order  bimolecular  reactions 

c^  log  ^2  _    1     ,    £  —  e  .^..x 

dT  2T~^    kT^  ^     ^ 

where  the  energy  of  activation  s  —  e  is  at  least  approximately  the  differ- 
ence between  the  average  energy  of  the  molecules  that  react  and  the 
average  energy  of  all  the  molecules. 

If  we  neglect  the  small  change  in  the  energy  of  activation  with  the 
temperature,  the  above  equations  can  be  integrated  to  give  the  following 
expressions  for  the  rates  of  the  two  kinds  of  reaction 
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dN       ,        _^— ^ 
—  —=koe      -TT  N  (615) 

or 

dC  —!1± 

— -71=^0  e      RT  C  (616) 

where  ko  is  the  constant  of  integration  and  E^  the  energy  of  activation 
per  mol,  and 

-^  =  ^0^/2  e-'-^  NN'  (617) 

or 

dC  £^+  E'A 

_^  =  VT'/2  ^ ^7^  CC  (618) 

ar 

where  ko  and  ^o'  are  constants,  and  Ea  +  -Ea'  is  the  sum  of  the 
energies  of  activation  per  mol  brought  into  the  reaction  by  the  two  sub- 
stances. The  method  of  using  equations  (617)  and  (618)  for  the  case 
that  the  two  molecules  are  alike  will  be  evident. 

323.  From  the  known  values  of  the  temperature  coefficients  of  re- 
action rate,  it  is  now  possible  with  the  help  of  the  above  equations  to  cal- 
culate the  actual  values  of  the  energies  of  activation  if  the  Arrhenius 
hypothesis  is  correct.  For  typical  reactions  in  which  the  rate  doubles 
or  trebles  per  ten  degree  rise  at  ordinary  temperatures,  this  leads  to 
energies  of  activation  of  the  order  which  are  known  from  spectral  data 
to  correspond  to  the  oscillations  of  atoms  inside  a  molecule.  This  seems 
entirely  reasonable,  since  the  weakening  of  a  chemical  bond  between 
atoms  might  very  probably  be  due  to  such  oscillations.  The  result  thus 
gives  some  small  evidence  in  support  of  the  Arrhenius  picture. 

324.  The  Mechanism  of  Activation. — If  we  are  willing  to  con- 
tinue on  the  basis  of  a  tentative  acceptance  of  the  Arrhenius  hypothesis, 
we  must  next  inquire  into  the  mechanism  of  the  processes  by  which  the 
molecules  can  be  activated  and  thus  prepared  to  enter  into  the  reaction. 
Since  the  activated  molecules  of  high  energy  content  are  continually  being 
removed  by  the  reaction,  we  have  as  a  minimum  requirement  the  neces- 
sity of  activating  the  molecules  as  fast  as  they  react. 

In  addition  the  order  of  the  reaction  may  impose  further  require- 
ments on  the  rate  of  activation.  Thus  if  the  reaction  is  first-order  and 
unimolecular,  the  rate  of  activation  must  itself  either  be  a  first-order 
process,  or  if  not  first  order,  fast  enough  to  maintain  the  equilibrium 
concentration  of  molecules  in  the  activated  state.  If  the  reaction  is  sec- 
ond  order  and   bimolecular,   taking  place   between   previously   activated 
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molecules,  as  we  saw  in  Section  318,  the  rate  of  activation  must  be  fast 
enough  to  maintain  the  equilibrium  concentration  of  the  activated 
molecules. 

We  shall  now  consider  the  possibilities  of  activation  by  ordinary 
collision, — by  collisions  of  the  second  kind, — by  the  absorption  of  radia- 
tion of  a  single  frequency, — and  by  the  absorption  of  radiation  of  more 
than  one  frequency. 

325.  Activation  by  Ordinary  Collision. — The  activation  of  a 
molecule  by  collision  with  another  molecule  of  high  enough  velocity  is 
one  of  the  most  obvious  hypotheses  to  propose. 

For  a  head-on  collision  between  two  particles,  it  can  easily  be  shown 
from  the  principles  of  conservation  of  energy  and  momentum  that  the 
maximum  kinetic  energy  which  can  be  transformed  into  other  forms  is 

m-i^fh.     R^ 
Wi  -|-  *^2  2 

where  m-^  and  nio  are  the  masses  of  the  two  particles  and  R  is  their 
relative  velocity.  Hence  for  a  collision  between  molecules  it  would  seem 
fairly  reasonable  to  take  this  same  expression  for  the  energy  available, 
provided  we  take  R  as  the  component  of  the  relative  velocity  of  the  two 
molecules  parallel  to  their  line  of  centers  at  the  instant  of  contact.  Hence 
if  £  is  the  energy  we  must  give  a  molecule  in  order  to  activate  it,  we 
shall  take  activation  by  collision  as  being  possible  only  when 

ni^nu     R^ 


nil  -j"  ^^^2   2 


>  £  (619) 


In  accordance  with  equation  (123a)  of  Chapter  5,  however,  the 
fraction  of  all  collisions  Z  in  which  the  relative  component  of  velocity 

has  a  value  greater  than  a  given  value  Rq  is  e  ~i'h  +  m.,  2kTso  that  we  may 
set  as  a  reasonable  upper  limit  for  the  rate  of  activation  by  collision 

rlAF  ''act. 

^^=Z^-T7-  (620) 

at 

where  Z  is  the  total  number  of  collisions  of  the  molecules  which  are  to 
be  activated  and  £act.  is  the  energy  on  the  average  that  has  to  be  supplied 
to  them. 

We  must  now  inquire  by  an  appeal  to  experimental  facts  whether 
such  a  rate  of  activation  would  be  fast  enough  to  account  for  known 
rates  of  reaction. 

326.  In  the  case  of  unimolecular  first-order  reactions  it  seems  certain 
that  the  rate  would  not  be  fast  enough.     The  clearest  cut  example  of  a 
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first-order  homogeneous  gas  reaction  is  the  decomposition  of  nitrogen 
pentoxide.  Comparing  the  rate  of  decomposition  of  this  gas  as  deter- 
mined by  Daniels  and  Johnston  at  25°  with  an  initial  concentration  of 
1.5  X  10"^  mols  per  cubic  centimeter,  with  the  rate  of  activation  cal- 
culated from  equation  (620)  substituting  10~^  as  the  diameter  of  the 
molecules,  the  decomposition  is  found  as  a  matter  of  fact  to  take  place 
approximately  10,000  times  as  rapidly  as  the  calculated  activation.^ 
Similar  results  would  be  found  for  other  first-order  reactions  with  large 
temperature  coefficients,  and  hence  we  cannot  assume  activation  by  ordi- 
nary collision  as  the  mechanism  for  such  reactions. 

327.  In  the  case  of  bimolecular  second-order  reactions  the  condition 
of  affairs  is  quite  different  and  it  seems  possible  that  activation  by  ordi- 
nary collision  might  account  for  known  reaction  rates.  A  very  good 
example  of  a  homogeneous  gas  reaction  of  this  kind  is  given  by  the 
decomposition  of  nitrous  oxide  studied  by  Hinshelwood  and  Burk. 

If  we  assume,  in  accordance  with  Section  318,  reaction  between 
previously  activated  molecules,  and  assume  that  each  molecule  brings  half 
the  necessary  energy  of  activation,  then  using  3.32  X  10"^  cm.  as  the 
diameter  of  a  molecule  it  is  found  by  calculation  from  equation  (620) 
that  the  number  of  molecules  activated  per  second  would  be  20,000  times 
as  great  as  the  number  actually  decomposing  at  the  most  unfavorable 
temperature.^  On  the  other  hand,  if  we  should  assume  that  one  of  the 
two  reacting  molecules  had  to  bring  all  the  energy  of  activation,  the  rate 
of  activation  by  collision  could  not  possibly  be  great  enough. 

Turning  now  to  the  possibility,  discussed  in  Section  320,  that  the 
reaction  occurs  between  nitrous  oxide  molecules  which  collide  with  sufiti- 
cient  relative  velocity  we  can  immediately  apply  the  methods  of  that 
section.  The  actual  rate  of  the  reaction  was  found  to  be  given  by  an 
expression  which  can  be  put  in  the  form  ^ 

dC  58450 

—  ^  =  7.60  X  10^^  r/2  e  --RT-  C 
dt 

On  the  other  hand,  by  taking  the  diameter  of  the  nitrous  oxide  mole- 
cule as  having  the  rather  large  value  1.1  X  10"'^  centimeters,  we  obtain 
from  equation  (123a)  in  Chapter  5  half  this  figure  for  the  number  of 
mols  of  nitrous  oxide  colliding  per  cubic  centimeter  per  second  with  an 
available  energy  greater  than  58,450  calories  per  mol.  Since  the  reaction 
would  decompose  two  molecules  of  oxide  per  collision,  it  also  seems  pos- 
sible to  account  for  the  reaction  on  this  basis. 

*  See  Tolman,  Jour.  Amer.  Chem.  Soc,  47,  1533  (1925). 
^Ibid.,  1535. 
*Ibid.,  1536. 


326-329]  TEMPERATURE  COEFFICIENT  273 

328.  We  thus  find  that  the  bimolecular  decomposition  of  nitrous  oxide 
can  be  accounted  for  on  the  basis  of  activation  by  ordinaiy  collisions. 
This  also  seems  to  be  true  for  other  bimolecular  gas  reactions.  Indeed 
there  seems  to  be  some  theoretical  justification  for  expecting  this  always 
to  be  true. 

To  show  this  let  us  consider  a  simple  case  of  bimolecular  reaction 
between  molecules  of  the  same  kind,  and  assume  reaction  for  every  col- 
lision where  each  of  the  molecules  is  in  the  activated  state  a.  The  rate 
of  the  reaction  will  then  be 

dN  lNpae~jr\  /^91^ 

X^pie      kT  / 

i 

where  Zq  is  the  number  of  collisions  at  unit  concentration.  On  the 
other  hand,  for  the  rate  of  activation  we  have  in  accordance  with  equa- 
tion (620) 

^^=Zo^-/7TiV2  (622) 

Since  the  ratio  of  (622)  over  (621),  however,  contains  the  large  quantity 

e  kf  we  may  expect  in  general  that  the  rate  of  activation  is  large  com- 
pared with  the  rate  of  reaction,  if  each  molecule  brings  half  the  total 
energy  of  activation. 

329.  Activation  by  Collisions  of  the  Second  Kind. — Since  ordi- 
nary collisions  will  not  account  for  the  mechanism  of  activation  for  uni- 
molecular  reactions,  we  must  investigate  other  possibilities. 

It  has  been  suggested  by  Christiansen  and  Kramers  *  that  the  mech- 
anism of  activation  consists  in  collisions  of  the  second  kind,  between 
activated  molecules  of  the  product  formed  in  the  reaction  and  unactivated 
molecules  of  the  original  reactant,  thus  raising  these  latter  to  an  activated 
state  and  preparing  them  to  enter  the  reaction. 

This  theory  can  best  be  understood  with  the  help  of  the  following 
schematic  representation  of  the  course  of  a  unimolecular  chemical  change 
in  which  a  molecule  of  reactant  R  breaks  down  into  products  P  and  Q. 

R:^R'  ->|^,  ~^  ^  (Decomposition)  (A) 

P'  -\-R  -^  P  -\-R'       (Collision  of  second  kind)  (B) 

In  accordance  with  expression  (A)  the  reactant  can  exist  in  two  forms, 
the  normal  form  R  and  an  activated  form  R'.     The  activated  form  breaks 

*  Christiansen  and  Kramers,  Zeitschr.  filr  physik.  Chem.,  104,  451  (1923). 
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down  spontaneously  into  activated  forms  of  the  products  F'  and  Q'  and 
these  then  change  into  the  normal  forms  P  and  Q.  This  latter  change, 
however,  can  take  place,  at  least  in  the  case  of  one  of  the  products, 
through  a  collision  of  the  second  kind  with  a  molecule  of  the  reactant  R, 
thus  leading  to  the  activation  of  a  new  molecule  through  the  mechanism 
indicated  by  expression  (B). 

In  case  the  reaction  deviates  from  a  steady  first-order  course  the 
number  of  such  activations  need  not  be  the  same  as  the  number  of  mole- 
cules that  decompose,  but  in  case  the  reaction  actually  takes  a  first-order 
course,  we  must  introduce  the  special  assumption  that  each  decomposition 
in  accordance  with  (A)  is  always  immediately  followed  by  a  successful 
activation  in  accordance  with  (B).  When  this  latter  assumption  is  true, 
it  is  evident  that  the  full  quota  of  activated  molecules  R^  allowed  by  the 
Maxwell-Boltzmann  law  will  be  maintained,  since  the  loss  of  a  molecule 
by  decomposition  is  immediately  made  good  by  the  specially  assumed 
mechanism,  and,  hence,  the  ordinary  agencies  responsible  for  the  transi- 
tions R^  R'  are  able  to  maintain  the  same  statistical  equilibrium  that 
would  exist  in  the  absence  of  reaction. 

Since  this  hypothesis  would  appear  to  provide  us  with  a  mechanism 
of  activation  which  would  always  maintain  the  full  Maxwell-Boltzmann 
quota  of  activated  molecules,  we  could  immediately  apply  the  methods 
of  Section  311,  and  would  have  no  difficulty  in  accounting  for  the 
temperature  coefficient  of  unimolecular  reactions.  The  method  of  applica- 
tion to  bimolecular  reactions  is  also  obvious. 

330.  Although  this  simple  and  ingenious  assumption  of  activation  by 
collisions  of  the  second  kind  has  much  to  recommend  it,  there  are  un- 
doubted difficulties  which  would  have  to  be  overcome  before  it  could  be 
regarded  as  the  probable  mechanism  of  activation  in  typical  thermal 
reactions.^ 

In  the  first  place  in  order  to  account  on  this  basis  for  a  unimolecular 
reaction  that  retains  its  first-order  character  throughout  the  reaction,  we 
should  have  to  assume  a  very  high  efficiency  in  the  process  of  reactiva- 
tion by  collisions  of  the  second  kind  as  represented  by  expression  (B). 
This  means  of  course  that  activated  molecules  R'  could  only  give  up 
their  energy  to  molecules  of  reactant  R.  Now  as  a  matter  of  fact  col- 
lisions of  the  second  kind  have  been  found  to  be  pretty  specific  in  their 
nature,  as  shown  by  the  work  of  Stuart  ^  on  the  quenching  of  the  fluores- 
cence radiation  from  mercury  in  the  2p2  state  by  the  addition  of  a  variety 
of  gases.     Nevertheless,  it  seems  unreasonable  to  assume  that  molecules 

°  In  cases  of  photochemical  catalysis,  collisions  of  the  second  kind  are  of  prime 
importance. 

"Stuart,  Zt.  filr  Pliys.,  32,  262  (1925). 
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of  the  activated  product  P'  could  make  a  large  number  of  collisions  with 
a  variety  of  molecules  of  different  kinds  without  losing  their  energy, 
and  thus  being  incapacitated  for  the  activation  of  the  original  reactant  R. 

Now  as  a  matter  of  fact,  the  decomposition  of  pure  nitrogen  pentoxide,'^ 
nitrogen  pentoxide  mixed  with  its  own  decomposition  products,  nitrogen 
pentoxide  mixed  with  1000  times  as  many  molecules  of  oxygen,^  and 
nitrogen  pentoxide  mixed  with  76,000  times  as  many  molecules  of  nitro- 
gen ^  all  take  place  at  the  same  rate.  In  addition  the  decomposition  of 
nitrogen  pentoxide  dissolved  in  carbon  tetrachloride  and  in  chloroform  ^° 
take  place  at  nearly  the  same  rate.  These  facts  make  it  very  difficult 
to  accept  the  proposal  that  collisions  of  the  second  kind  account  for  the 
mechanism  of  activation  in  the  decomposition  of  nitrogen  pentoxide. 

A  second  difficulty  that  would  have  to  be  met  by  the  hypothesis  of 
activation  by  collisions  of  the  second  kind  arises  in  the  case  of  endo- 
thermic  reactions,  since  in  such  reactions  a  molecule  of  product  in 
dropping  to  its  normal  state  would  not  give  up  enough  energy  to  raise  a 
molecule  of  the  reactant  from  its  normal  to  its  activated  state.  Hence, 
in  the  case  of  endothermic  reactions,  collisions  of  the  second  kind  would 
presumably  have  to  be  supplemented  by  an  additional  activational  step. 
If  this  supplementary  step  should  occur  before  the  collision  of  the  sec- 
ond kind,  then  the  concentration  of  molecules  of  reactant  in  a  state 
suitable  to  make  successful  collisions  with  the  activated  product  would 
be  much  lower  than  the  total  concentration  of  reactant,  and  the  number 
of  collisions  during  which  the  activated  product  would  have  to  withhold 
its  energy  would  be  greatly  increased.  On  the  other  hand,  if  the  col- 
lisions of  the  second  kind  should  be  followed  by  a  supplementary  activa- 
tion, it  is  evident  that  this  step  would  have  to  be  one  of  very  high 
specific  velocity,  since  the  concentration  of  molecules  in  the  intermediate 
state  would  be  much  lower  than  the  total  concentration  of  reactant.  In 
either  case  it  is  evident  that  activation  by  collisions  of  the  second  kind 
gives  no  easy  explanation  of  the  mechanism  of  endothermic  reactions. 

In  this  connection  it  is  important  to  note  that  the  decomposition  of 
nitrogen  pentoxide  is  endothermic  to  the  extent  of  12,290  calories  when 
it  goes  as  it  does  at  the  beginning  of  reaction  to  nitrogen  dioxide,  and 
apparently  still  endothermic  to  the  extent  of  2,690  calories  even  when 
it  goes  to  the  tetroxide.^^  The  thermochemistry  of  the  actual  individual 
steps  of  the  reaction  is  of  course  unknown,  since  the  steps  themselves  are 

'Daniels  and  Johnston,  Jotir.  Amer.  Chem.  Soc,  43,  53  (1921). 

"White  and  Tolman,  Ibid,  47,  1240  (1925). 

"Hunt  and  Daniels,  Ibid,  47,  1602  (1925). 

"Lueck,  Ibid,  44,  757  (1922). 

"  Calculated  from  thermochemical  values  of  Thomsen,  "Thermochemistry," 
Longmans,  Green  and  Co.,  1920,  p.  218,  and  Berthelot,  Ann.  Chim.  phys.,  6,  170, 
171  (1875). 
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unknown.  The  matter  does  not  look  hopeful,  however,  for  the  proposed 
mechanism  of  activation. 

We  must  conclude  that  activation  by  collisions  of  the  second  kind  does 
not  now  seem  possible  as  a  general  mechanism  of  activation  for  thermal 
reactions. 

331.  Activation  by  the  Absorption  of  Radiation  of  a  Single 
Frequency. — The  very  simple  postulate  has  been  made  by  Perrin  and 
W.  C.  McC.  Lewis  that  the  process  of  activating  a  molecule  consists 
in  the  absorption  of  a  quantum  of  energy  from  the  store  of  thermal 
radiation  which  surrounds  the  reacting  molecules  in  accordance  with  the 
expression 

hv-\-R^R'  (623) 

In  the  original  formulation  of  the  theory  only  the  absorption  of  a 
single  frequency  was  considered,  corresponding  to  the  existence  of  one 
normal  and  one  activated  state,  so  that  the  energy  of  activation  would 
be  directly  connected  with  the  frequency  of  the  radiation  by  the  quantum 
relation 

fiact.  =  hv  (624) 

Assuming  the  radiation  distributed  in  accordance  with  the  Planck 
law,  we  can  write,  for  the  radiation  density  of  frequency  v  the  expression 

u,  =  -—-    ^_i  (625) 

and  hence  if  Bna  is  the  value  of  Einstein's  coefficient  of  absorption  for 
the  passage  from  the  normal  state  n  to  the  activated  state  a,  we  can  write 
for  the  rate  of  activation  on  the  basis  of  this  hypothesis 

e  kT  —  1 

IlL 
Noting  that  e  kT  is  usually  large  compared  with  unity,  and  substituting 

equation  (624)  this  can  be  rewritten  in  the  form 

—J^  =  Bna—^e     kT   N  (627) 

In  the  case  of  first-order,  unimolecular  reactions  it  is  to  be  assumed 
that  every  molecule  or  some  definite  fraction  of  the  molecules  would  react 
as  soon  as  activated  and  hence  the  rate  of  reaction  be  some  definite 
fraction  of  the  value  given  by  equation  (627).  Hence  the  result  agrees 
in  form  and  content  with  our  earlier  equation    (615)    for  the  rate  of 
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first-order  reactions.  In  the  case  of  second-order,  bimolecular  reactions, 
it  is  assumed  that  the  above  mechanism  of  activation  is  fast  enough  to 
maintain  the  full  quota  of  activated  molecules  permitted  by  the  Maxwell- 
Boltzmann  distribution  law,  the  rate  of  reaction  then  depending  on  the 
number  of  coHisions  between  activated  molecules  as  treated  in  Section  318. 
Although  there  are  a  number  of  considerations  which  show,  as  we 
shall  see,  that  the  radiation  theory  in  this  simple  form  could  not  give  a 
valid  account  of  the  mechanism  of  all  reactions,  nevertheless  we  cannot 
but  admire  the  boldness  and  beauty  of  this  conception  which  would  unify 
our  pictures  of  thermal  and  photochemical  reaction,  and  account  in  the 
simplest  possible  manner  for  the  orders  of  reaction  by  providing  a 
method  of  activation  which  is  independent  of  concentration. 

332.  We  must  first  inquire  whether  the  rate  of  activation  given  by 
equation  (627)  could  be  fast  enough  to  account  for  known  rates  of  re- 
action. To  make  this  calculation,  we  must  have  a  value  for  the  coefficient 
of  absorption  Bna,  and  for  this  may  make  use  of  the  discussion  given 
in  Section  221,  Chapter  16,  which  indicates  that  values  of  Bna  greater 
than  10-°  in  c.g.s.  units  are  not  likely  to  be  encountered.  Hence,  since 
everything  else  in  the  equation  will  be  known  in  the  case  of  any  actual 
reaction  whose  temperature  coefficient  has  been  determined,  we  are  at 
once  in  a  position  to  test  the  question. 

Again  considering  the  decomposition  of  nitrogen  pentoxide  as  a 
typical  unimolecular  reaction,  we  find,  however,  that  the  rate  of  activa- 
tion calculated  from  equation  (627)  is  smaller  by  a  factor  of  10*^  than  the 
actual  rate  of  decomposition  experimentally  found. ^^  Similar  relations 
will  hold  for  any  unimolecular  reaction  whose  rate  has  a  high  tempera- 
ture coefficient. 

On  the  other  hand,  in  the  case  of  bimolecular  reactions,  equation  (627) 
might  lead  to  fast  enough  rates  of  activation  provided  we  assume  that 
half  the  energy  of  activation  is  brought  into  the  reaction  by  each  of 
the  two  reacting  molecules.  Thus  for  the  decomposition  of  nitrous 
oxide  the  calculated  rate  of  activation  is  800  times  the  rate  of  decomposi- 
tion at  the  most  unfavorable  temperature.^^ 

333.  In  connection  with  such  calculations  as  to  the  rate  of  activa- 
tion by  the  absorption  of  radiation,  it  should  of  course  be  noted  that  in 
obtaining  equation  (627)  we  have  assumed  the  radiation  density  dis- 
tributed in  accordance  with  the  Planck  radiation  law.  The  process  of 
activation,  however,  would  draw  on  the  supply  of  radiant  energy  of  the 
particular  frequency  required  and  tend  to  lower  the  radiation  density  at 

"See  Tolman,  Jour.  Amer.  Chcm.  Soc,  47,  1543  (1925). 
^Ibid,^.  1544. 
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that  point  below  the  equilibrium  value  prescribed  by  the  Planck  law. 
Furthermore  the  occurrence  of  the  reaction  would  presumably  lead  to 
the  reemission  of  radiation  of  some  other  frequency,  higher  or  lower  than 
the  original,  depending  on  whether  the  total  reaction  was  exothermic  or 
endothermic.  How  rapidly  radiation  equilibrium  would  reestablish  itself 
under  these  circumstances  is  difficult  to  determine.  Of  course  the  walls 
of  the  container  would  play  their  part  in  maintaining  the  equilibrium, 
but  in  addition  all  the  radiation  in  the  enclosure  through  interaction  with 
the  gas  molecules  present  would  strive  to  readjust  itself  to  equilibrium, 
and  it  is  not  now  evident  how  fast  this  would  take  place. 

334.  In  addition  to  the  failure  of  the  simple  radiation  theory  to  pro- 
vide a  fast  enough  rate  of  activation  to  account  for  the  rate* of  uni- 
molecular  reactions  of  high  temperature  coefficient,  the  radiation  theory 
is  subject  to  a  very  special  test,  since  with  the  help  of  equation  (624) 
we  can  calculate  from  the  empirically  determined  energy  of  activation 
the  frequency  of  light  which  should  be  photochemically  active  in  bringing 
about  reaction.  Thus,  in  the  case  of  the  decomposition  of  nitrogen 
pentoxide,  the  energy  of  activation  is  found  to  be  24,700  calories  per 
mol  and  this  corresponds  to  a  wave-length  of  1.61  //.  It  has  been  definitely 
shown  by  Daniels  and  Johnston,^*  however,  that  nitrogen  pentoxide  is 
not  decomposed  by  light  of  this  wave-length.  Furthermore  nitrogen 
pentoxide  has  no  known  absorption  line  or  band  at  this  wave-length. 

335.  Hence  as  the  conclusion  of  this  section  it  may  be  definitely  stated 
that  the  simple  radiation  theory  which  assumed  activation  by  the  passage 
from  a  single  normal  to  a  single  activated  state  through  the  absorption 
of  a  quantum  of  radiation  cannot  give  a  satisfactory  account  of  uni- 
molecular  reactions,  both  because  the  rate  of  activation  would  be  too 
small  and  because  in  at  least  one  case  the  substance  does  not  respond 
to  radiation  of  the  predicted  wave-length. 

336.  Elaborated  Radiation  Theory  of  Activation. — The  simplicity 
and  apparent  promise  of  the  original  radiation  theory  of  chemical  activa- 
tion has  suggested  the  hope  that  an  elaborated  form  of  the  theory  might 
be  able  to  account  for  the  facts.  The  natural  modifications  to  make 
would  be  of  course  to  assume  instead  of  a  single  normal  state  and  single 
activated  state,  a  number  of  chemically  active  or  inactive  quantum  states 
thus  leading  to  the  possibility  of  activation  by  the  absorption  of  other 
frequencies  than  those  given  by  equation  (624),  which  makes  the  quantum 
of  absorbed  radiation  exactly  equal  to  the  energy  of  activation. 

There  are  two  reasons  why  such  a  modification  would  in  any  case 

"Daniels  and  Johnston,  Jour.  Amer.  Chem.  Soc,  43,  72  (1921). 
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seem  inevitable.  In  the  first  place  the  experimental  investigation  of 
the  quantum  properties  of  molecules  has  in  every  case  shown  the  exist- 
ence of  many  quantum  states,  and  in  the  second  place  the  facts  of  photo- 
chemistry show  that  chemical  reactions  are  in  general  not  induced  by  a 
single  frequency,  but  usually  by  a  whole  range  of  frequencies. 

337.  Such  a  modification  of  the  radiation  theory  of  reaction  would 
of  course  immediately  remove  the  difficulty  which  arises  because  the  fre- 
quency of  radiation,  calculated  from  the  energy  of  activation  by  equa- 
tion (624),  is  not  found  experimentally  to  be  absorbed  nor  to  lead  to 
reaction.  On  the  new  basis  the  energy  of  activation  will  not  be  changed, 
but  this  energy  need  not  be  acquired  by  a  single  jump  from  a  single 
normal  to  a  single  activated  state. 

338.  It  is  more  difficult  to  decide  whether  a  modification  of  the 
theory  along  the  above  lines  would  lead  to  rapid  enough  rates  of  activa- 
tion to  account  for  known  reaction  rates.  To  investigate  this  point  it 
should  be  noted  that  the  new  theory  would  lead  both  to  the  assumption 
of  chemically  inactive  states  of  energy  intermediate  between  that  of 
the  lower  normal  states  and  that  necessary  for  complete  chemical  activa- 
tion, and  to  the  assumption  of  more  than  a  single  higher  quantum  state 
which  would  be  chemically  active.  It  will  be  simplest  to  give  separate 
consideration  to  the  effect  produced  by  the  introduction  of  these  two 
different  kinds  of  states. 

339.  With  regard  to  the  intermediate  states  of  less  energy  than  that 
necessary  for  full  chemical  activation  it  seems  evident  that  the  intro- 
duction of  such  states  would  not  lead  to  any  marked  increase  in  the  rate 
of  activation,  such  as  would  be  necessary  if  we  are  to  account  for  known 
rates  of  unimolecular  reaction.  This  arises  from  the  fact  that  the  greater 
density  of  radiation  which  would  be  effective  in  causing  transitions  from 
an  intermediate  state  to  the  activated  state,  compared  with  that  effective 
in  the  full  transition  from  the  normal  state,  would  be  compensated  by  a 
smaller  number  of  molecules  in  the  intermediate  state,  compared  with 
the  number  in  the  normal  state.  Thus,  if  £„  is  the  total  energy  of  activa- 
tion and  Ei  the  energy  necessary  to  raise  molecules  to  the  intermediate 
state,  the  density  of  radiation  effective  in  raising  molecules  from  the 
intermediate  to  a  completely  activated  state  would,  in  accordance  with 
the  Planck  radiation  law,  be  greater  than  that  corresponding  to  the  whole 

jump  by  a  factor  of  approximately  e  kT ;  on  the  other  hand,  m  accordance 
with  the  Maxwell-Boltzmann  distribution  law,  the  number  of  molecules 
in  the  intermediate  state  would  be  less  than  that  in  the  normal  state  by 

the  reciprocal  factor  ^"~/>r,     The  situation  may  be  summed  up  by  the 
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statement  that  the  assumption  of  intermediate  states  between  the  normal 
and  chemically  activated  states  is  almost  certainly  correct,  and  leads  to 
an  explanation  of  the  failure  of  reactions  to  respond  to  calculated  fre- 
quencies. Nevertheless,  it  does  not  seem  probable  that  a  mere  assump- 
tion of  such  intermediate  states  would  alone  lead  to  the  great  increase 
in  speed  of  activation  necessary  to  rescue  the  radiation  theory  of  activation. 

340.  Turning  now,  however,  to  the  assumption  of  more  than  a  single 
state  which  is  chemically  active,  it  is  evident  that  the  condition  of  afifairs 
is  different,  since  a  jump  to  any  one  of  these  states  might  lead  to  re- 
action. Nevertheless,  it  is  also  evident  from  the  discussion  in  Section  332, 
that  we  should  need  to  postulate  a  great  many  such  activated  states  in 
order  to  account  for  known  rates  of  reaction,  since  in  the  case  of  uni- 
molecular  reactions  of  normal  temperature  coefficient  the  assumption  of  a 
single  state  gives  us  a  rate  of  activation  which  is  too  small  by  a  factor 
of  the  order  of  10^.  Hence  it  would  seem  evident  that  the  chemically 
active  states  would  need  to  lie  close  together  in  energy  content  in  order 
to  secure  sufficient  states  within  the  range  where  the  Planck  radiation 
law  would  permit  enough  radiation  density  to  permit  of  frequent 
transitions. 

There  are,  however,  two  reasons  for  believing  that  such  a  continu- 
ous range  of  quantum  states  with  gradually  increasing  energy  contents 
might  exist. 

341.  In  the  first  place  the  facts  of  photochemistry  show,  as  already 
mentioned,  that  photochemical  reaction  is  by  no  means  always  induced  by 
the  absorption  of  a  single  frequency,  but  often  by  the  absorption  of  a 
continuous  range  of  frequencies.  This  corresponds  no  doubt  to  the  exist- 
ence of  a  continuous  range  of  chemically  active  states. 

342.  In  the  second  place  it  seems  reasonable  to  assume  that  the 
process  of  chemical  activation  consists  in  increasing  the  energy  of  vibra- 
tion of  atoms  within  the  molecule.  This  would  correspond  with  the  chem- 
ist's rough  idea  of  "loosening  a  chemical  bond"  and  would  presumably 
make  the  atoms  more  free  to  enter  into  new  chemical  combinations. 
Furthermore,  the  magnitudes  of  known  energies  of  activation  correspond 
to  frequencies  in  the  near  infra-red  which  we  know  to  be  absorbed,  when 
changes  in  atomic  oscillation  take  place.  Hence,  we  might  feel  justified 
in  making  the  hypothesis  that  the  chemically  activated  condition  of  the 
molecule,  at  least  in  certain  cases,  might  be  one  in  which  the  amplitude 
of  atomic  oscillation  has  become  so  great  that  the  forces  holding  an 
oscillating  atom  in  to  the  molecule  are  greatly  weakened  at  the  exterior 
limit  of  its  motion. 
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This  hypothesis,  however,  immediately  leads  to  an  important  con- 
sequence. If  the  oscillations  become  so  large  that  the  restoring  forces 
at  the  exterior  limit  of  oscillation  are  small,  it  is  evident  that  the  fre- 
quency of  atomic  oscillation  will  also  become  small  in  the  same  way  that 
the  frequency  of  oscillation  of  any  elastic  system  decreases  as  it  is 
strained  beyond  its  elastic  limit.  For  any  conditionally  periodic  system 
of  periodicity  s,  however,  the  energy  E,  fundamental  frequencies  coi .  . . .  a>« 
and  phase  integrals  Ii  ....  Is  are  connected  by  the  classical  equation 

dE  =  oj.dh  +  ....  +  cosdis  (628) 

Hence,  for  any  oscillation  in  which  a  frequency,  say  cor,  has  become  small, 
the  rate  of  increase  of  energy  E  with  the  corresponding  phase  integral 
Ir  will  be  small  in  accordance  with  the  equation 

dE 

if  =  COr  (629) 

air 

Successive  quantized  states  of  the  system,  however,  are  those  in  which 
the  phase  integrals  differ  by  successive  amounts  h.  It  is  immediately  evi- 
dent that  the  energy  levels  of  successive  quantized  states  will  lie  near 
together  when  the  frequency  involved  becomes  small.  The  close  suc- 
cession of  quantized  states  for  a  Planck  oscillator  of  small  frequency  is 
of  course  a  special  case  of  the  above  more  general  result.  In  other 
words,  as  soon  as  the  amplitude  of  atomic  oscillation  within  a  mole- 
cule has  become  large  enough  so  that  the  bond  holding  the  atom  in  ques- 
tion has  become  greatly  weakened,  the  successive  quantized  states  of 
the  molecule,  that  is,  the  activated  states  of  the  molecule,  will  have  energy 
levels  lying  near  together.  Indeed,  if  the  oscillations  become  so  large 
that  the  force  restraining  the  atom  at  the  exterior  limit  of  its  motion 
approaches  zero,  we  obtain  a  continuous  series  of  activated  states.  Such 
a  complete  weakening  of  the  bond  might  be  necessary  for  certain  re- 
actions and  would  seem  especially  reasonable  in  the  case  of  unimolecular 
changes. 

343.  If  we  proceed  on  the  suggested  basis  and  assume  a  series  of 
activated  states,  it  is  now  evident  that  the  rate  of  activation  will  be 
given  by  a  summation  of  the  form 

£L  „     Snhvi  " 


.,,  ^^Npie-~kf  Bia ^    hii^       1 

f^A^act.   _     i  a  C^  ekT    —  1 

lit  " 

^^  ^Pie-kf 


(630) 


where  Bia  is  Einstein's  coefficient  of  absorption  for  radiation  of  the  fre- 
quency necessary   for  transition   from  some  particular  unactive  state  i 
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to  a  particular  chemically  active  state  a,  and  the  double  summation  is 
to  be  taken  over  all  such  pairs.  The  summation  could  of  course  be  re- 
placed by  integration  in  the  case  of  a  continuous  series  of  activated 
states.  Whether  the  increased  rate  of  activation  vi^hich  equation  (630) 
would  give  as  compared  v^ith  the  earlier  equation  (627)  would  be  suffi- 
cient to  account  for  the  observed  rate  of  reaction  in  any  special  case 
would  of  course  be  dependent  on  molecular  properties  of  the  substance 
involved  which  are  not  now  at  our  disposal. 

344.  This  elaborated  radiation  theory  can  also  be  formulated  in  a 
somewhat  different  manner.  On  the  basis  of  this  theory  thermal  reaction 
is  to  be  looked  upon  as  the  combined  result  of  all  the  photochemical 
reactions  going  on  in  the  system  because  of  the  thermal  radiation.  Hence 
li  k"  is  the  specific  photochemical  rate  of  reaction  corresponding  to  the 
frequency  v,  and  we  assume  that  the  thermal  reaction  is  due  to  the  thermal 
radiation  present  in  the  reaction  mixture  with  the  energy  density  given 
by  the  Planck  radiation  law,  we  may  evidently  write  for  the  specific 
thermal  rate  ^^  of  a  unimolecular  reaction  the  expression 

0  c  A  y  —  1 

This  expression  was  first  published  by  the  author  ^^  in  1920  and  evidently 
gives  the  desired  extension  to  the  simple  radiation  theory  of  reaction. 

Although  equation  (631)  gives  an  undoubted  improvement  in  the 
formulation  of  the  radiation  theory  of  reaction,  nevertheless  the  main 
part  of  our  task  would  be  to  determine  whether  this  equation  actually 
leads  to  rapid  enough  rates  to  account  for  known  reaction  velocities, 
thus  rescuing  the  radiation  theory  of  chemical  reaction  from  its  main 
difficulty.  Unfortunately,  the  answer  to  this  question  cannot  now  be 
given,  since  it  would  involve  a  quantitative  knowledge  of  specific  photo- 
chemical reaction  rates,  which  we  do  not  possess.  Certainly  the  burden 
of  proof  must  rest  on  those  who  would  maintain  the  applicability  of 
equation  (631).  The  difficulties  in  subjecting  this  equation  to  a  proper 
test  should  not  be  overlooked.  The  effect  of  a  wide  range  of  fre- 
quencies must  be  studied,  since  of  course  the  actual  distribution  of  active 
wave-lengths  cannot  be  predicted.  In  particular  it  is  to  be  remembered, 
in  accordance  with  the  discussion  of  the  preceding  sections,  that  activa- 
tion may  occur  in  steps  and  the  last  steps  of  the  activation  process  may 
involve  very  small  energy  increases,  so  that  the  far  infra-red  as  well 
as  the  near  infra-red  must  be  searched.  Finally,  it  must  not  be  for- 
gotten that  k^   is  the  specific  photochemical  reaction  rate  at  the  tcmpera- 

^'Tolman,  Journ.  Amer.  Chem.  Soc,  42,  2506  (1920). 
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ture  in  question  in  the  presence  of  the  thermal  radiation  of  all  the  other 
frequencies.  Hence,  since  many  photochemical  reactions  have  appreciable 
temperature  coefficients,  we  must  determine  directly  or  indirectly  the  in- 
crease in  reaction  rate  when  we  shine  light  of  the  different  wave-lengths 
into  the  reaction  mixture  at  the  reacting  temperature. 

345.  To  sum  up  the  results  of  this  treatment,  we  may  state  that  there 
are  experimental  and  theoretical  reasons  for  believing  in  the  possibility  of  a 
series  of  activated  states  of  nearly  continuously  increasing  energy  con- 
tent, rather  than  in  a  single  activated  state.  It  cannot  be  said  at  the 
present  time,  however,  whether  this  leads  to  a  sufficient  increase  in  rates 
of  activation  by  absorption  of  radiant  energy  to  account  for  known  reaction 
velocities. 

346.  Concluding  Remarks. — The  discussion  of  the  temperature  co- 
efficient of  reaction  rate  given  in  the  foregoing  chapter  has  concerned 
itself  with  the  high  coefficients  found  in  typical  homogeneous  gas  reactions. 

Reactions  such  as  the  recombination  of  bromine  atoms  to  form  the 
molecule  which  appears  to  have  no  appreciable  temperature  coefficient, 
present  of  course  no  difficulty,  since  we  merely  have  to  assume  that  a 
certain  fraction  of  the  collisions  between  unactivated  bromine  atoms  lead 
to  recombination  with  an  emission  of  radiation  to  remove  the  excess 
energy,  the  process  thus  being  the  reverse  of  the  dissociation  of  bromine 
by  radiation. 

As  for  heterogeneous  reactions,  as  we  saw  in  the  last  chapter  the 
temperature  coefficient  may  in  some  cases  be  dependent  on  the  change 
with  temperature  in  the  fraction  of  the  surface  covered  with  an  adsorbed 
film.  On  the  other  hand,  changes  with  temperature  in  rate  of  diffusion 
and  actual  chance  of  reaction  will  have  to  be  considered  when  the  accumu- 
lation of  experimental  data  warrant  a  more  complete  theory. 

347.  The  theory  presented  in  this  chapter  has  all  been  based  on  the 
Arrhenius  picture  of  activation  as  the  mechanism  of  chemical  reactions 
that  have  large  temperature  coefficients. 

A  considerable  loophole  in  the  theory  consists  in  the  fact  that  in- 
dependent evidence  for  the  existence  of  these  activated  states  is  mostly 
lacking.  In  this  connection,  however,  the  considerations  of  Wulf  ^^ 
giving  some  evidence  for  their  presence  in  the  thermal  decomposition 
of  ozone  should  be  noted.  Furthermore,  we  have  the  certainty  that 
molecules  are  chemically  reactive  when  they  have  been  excited  by  the 
absorption  of  radiation  in  photochemical  experiments,  or  excited  by  col- 
lisions of  the  second  kind  in  experiments  on  induced  photochemical  effects, 

"Wulf,  Proc.  Nat.  Acad.,  12,  129  (1926). 
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or  excited  by  collision  with  fast  moving  particles  shot  in  from  radioactive 
sources. 

348.  The  worst  difficulty,  however,  which  has  been  encountered  by 
the  Arrhenius  picture  is,  as  we  have  seen,  that  of  finding  mechanisms  of 
activation  which  would  act  rapidly  enough  to  keep  up  with  known  rates 
of  reaction. 

In  this  connection,  neverthless,  we  must  call  attention  to  a  certainly 
demonstrated  and  definite  difficulty  of  the  same  kind  in  connection  with 
the  famous  Stern  and  Gerlach  "  experiment  on  the  orientation  of  silver 
atoms  in  a  magnetic  field.  In  this  experiment  a  ray  of  vaporized  silver 
atoms  is  shot  between  the  poles  of  a  strong  magnet,  the  atoms  remain- 
ing in  the  magnetic  field  only  for  a  time  of  the  order  of  10"*  seconds. 
Nevertheless,  in  this  short  time  interval  all  the  atoms  orient  themselves 
so  that  their  own  magnetic  axis  is  parallel  to  the  field  and,  most  remark- 
able of  all,  approximately  half  the  atoms  turn  in  the  direction  of  lining 
up  with  the  field  and  the  other  half  in  the  opposite  direction.  Hence 
half  the  atoms  receive  energy  at  a  rate  far  greater  than  we  can  account 
for  on  the  basis  of  any  known  mechanisms  of  energy  transfer. 

A  similar  difficulty  is  possible  in  the  case  of  the  decomposition  of 
nitrogen  tetroxide  to  form  the  dioxide.  This  reaction  appears  from  experi- 
ments on  the  velocity  of  sound  through  a  mixture  of  the  two  gases  to 
take  place  at  a  rate  so  fast  as  not  to  be  tneasurable.^^  Hence,  here,  too, 
we  must  inquire  into  the  mechanism  by  which  the  tetroxide  molecules 
can  acquire  energy  fast  enough  to  dissociate  in  a  time  which  is  short  com- 
pared with  the  period  of  the  sound  pulsations.  It  is  to  be  specially  noted 
in  this  case,  moreover,  that  here  we  are  dealing,  not  with  a  hypothetical 
energy  of  activation,  but  with  the  known  energy  necessary  for  the  dis- 
sociation of  the  tetroxide  molecule. 

349.  These  remarks  as  to  difficulties  in  other  cases  of  accounting  for 
the  rapidity  with  which  energy  is  received  by  atoms  or  molecules  may 
indicate  the  necessity  of  a  fundamental  modification  in  our  ideas  as  to 
the  conservation  or  transfer  of  energy.  Two  possibilities  would  seem  to 
be  open. 

First  we  might  give  up  the  conservation  of  energy  except  as  a  sta- 
tistical principle.  Then  we  should  never  have  to  inquire  where  the 
energy  comes  from  when  a  molecule  is  excited,  but  should  merely  have 
to  balance  our  energy  account  in  the  long  run.  In  the  case  of  the  Stern 
and  Gerlach  experiment,  since  approximately  half  the  silver  atoms  re- 
ceive energy  and  the  other  half  lose  it,  this  balance  is  immediately  made. 
In  the  case  of  the  passage  of  sound  waves  through  a  mixture  of  nitrogen 

"  Stern  and  Gerlach,  Zeitschr.  fiir  Physik.,  9,  349  (1922). 
"Griineisen  and  Goens,  Ann.  der  Phys.,  72,  193  (1923). 
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tetroxide  and  dioxide,  the  balance  is  made  at  the  end  of  every  complete 
period  of  the  wave.  And  finally  to  balance  our  account  in  the  case  of 
chemical  reactions  taking  place  through  activation,  w^e  should  not  have  to 
supply  the  whole  calculated  energy  of  activation,  but  only  the  much  smaller 
positive  or  negative  energy  actually  necessary  for  the  reaction  itself,  since 
in  endothermic  reactions  most  of  the  energy  of  activation  is  immediately 
again  made  available  by  the  deactivation  of  the  products,  and  in  exo- 
thermic reactions  there  is  even  energy  to  spare. 

A  second  possibility  would  be  always  to  assume  the  conservation  of 
energy  even  for  individual  molecular  processes,  but  to  postulate  when 
necessary  a  very  special  coupling  by  which  energy  disappearing  from 
one  molecule  could  reappear  in  another.  Thus  in  the  Stern  and  Gerlach 
experiment,  eveiy  silver  atom  that  turned  with  the  field  would  be 
"coupled"  with  one  which  turned  against  the  field  and  would  supply  to  it 
the  necessary  energy.  In  the  passage  of  a  sound  wave  through  the 
mixture  of  tetroxide  and  dioxide,  each  molecule  of  tetroxide  dissociat- 
ing in  region  of  low  pressure  could  receive  its  energy  from  a  molecule 
recombining  in  a  region  of  high  pressure.  And  in  the  case  of  chemical 
reactions  taking  place  through  activation,  a  molecule  to  be  activated 
would  draw  its  supply  directly  from  that  made  available  by  the  completion 
of  the  reaction  undergone  by  some  other  molecule.  Such  immediate  trans- 
fers, however,  would  of  course  take  place  in  ways  not  now  known  to  us. 

Of  course  it  may  not  be  necessary  to  introduce  hypotheses  of  such  a 
kind  in  order  to  preserve  the  Arrhenius  hypothesis  of  chemical  reaction 
through  activation.  On  the  other  hand,  it  must  of  course  be  emphasized 
that  the  picture  itself  may  be  wrong.  These  are  among  the  most  impor- 
tant questions  for  future  chemistry. 


Chapter  22. 
Photochemical  Reactions. 

350.  Introduction, — In  the  present  chapter  we  shall  consider  the 
yield,  rate,  and  change  of  rate  with  temperature  in  the  case  of  photo- 
chemical reactions.  Theoretically  at  least  these  should  be  simpler  than 
thermal  reactions,  since  we  have  definite  information  as  to  the  nature 
of  the  agency  responsible  for  the  action.  In  practise,  however,  the  radia- 
tion absorbed  may  often  start  a  complicated  chain  of  events  so  that  the 
different  factors  in  the  final  result  are  hard  to  disentangle.  In  addition, 
with  the  experimental  methods  usually  employed,  complications  are  in- 
troduced by  the  fact  that  a  range  of  frequencies  instead  of  monochromatic 
radiation  is  often  used  and  the  density  of  the  radiation  is  usually  unknown 
and  different  in  different  parts  of  the  illuminated  mixture. 

We  shall  content  ourselves  with  the  theoretical  consideration  of  a  few 
cases  which  are  perhaps  artificially  oversimplified.  Nevertheless,  the 
treatment  will  indicate  the  nature  of  the  methods  to  be  employed  in 
analyzing  more  complicated  experimental  results  into  the  individual  proc- 
esses in  terms  of  which  they  must  be  explained. 

351.  Einstein's  Law  of  Photochemical  Equivalence. — It  is  evi- 
dent that  the  primary  process  in  any  photochemical  reaction  must  be 
the  absorption  of  a  quantum  of  radiation  by  a  molecule  in  some  lower 
quantum  state  i  to  go  over  to  a  higher  state  a,  as  indicated  in  reaction 
form  by  the  expression 

hv-\-Ai  -^  Aa  (632) 

In  case  complications  do  not  arise,  we  can  then  expect  each  mole- 
cule so  activated  to  enter  into  reaction^  so  that  the  total  extent  of  the 
chemical  change  will  be  directly  proportional  to  the  number  of  quanta 
absorbed.  This  is  Einstein's  lazv  of  photochemical  equivalence  which 
can  be  expressed  in  the  form 

Q  =  n^^  (633) 

where  Q  is  the  number  of  molecules  reacting,  E  is  the  radiant  energy 
absorbed,  and  n  is  some  small  integer  which  gives  the  number  of  mole- 
cules whose  chemical  change  is  brought  about  by  a  single  process  of  activa- 
tion.    Tests  of  this  law  are  of  course  to  be  made  by  measuring  the 

286 
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intensities  of  the  incident  and  transmitted  radiation  in  a  photochemical 
experiment  and  comparing  the  energy  absorbed  with  the  extent  to  which 
the  reaction  proceeds. 

352.  Examples  of  Einstein's  Law. — This  law  of  photochemical 
ecjuivalence  is  found  to  be  obeyed  in  the  case  of  a  considerable  number 
of  reactions,  and  may  be  regarded  as  the  ideal  state  of  afifairs  from  which 
deviations  are  to  be  reckoned. 

As  a  simple  example  of  the  law  we  may  consider  the  decomposition 
of  gaseous  hydrogen  bromide  by  light,  a  reaction  which  has  been  studied 
by  Warburg.^  With  radiation  of  wave-length  2090  A.  and  2530  A.  it 
was  found  within  the  experimental  error  that  two  molecules  of  hydrogen 
bromide  were  decomposed  for  each  quantum  absorbed. 

This  result  might  be  explained  by  either  of  the  mechanisms 

hv  +  HBr  ->  H  +  Br 
H  +  HBr  -»  H2  +  Br 
2Br  -^  Bro 
or 

hv  +  HBr  ^    [HBr] 
[HBr]  +  HBr  -^  H,  +  Bro 

where  the  brackets  indicate  an  activated  form  of  the  molecule.  The 
energy  contained  in  one  quantum  at  the  frequencies  used  was  as  a  matter 
of  fact  sufficiently  great  to  dissociate  the  molecule.  Nevertheless,  mole- 
cules are  known  to  have  electron  jumps  which  absorb  more  than  the 
energy  of  dissociation,  without  the  atoms  separating.  Hence  a  decision 
between  the  two  mechanisms  on  this  basis  alone  would  not  be  possible. 
Similar  results  were  also  found  by  Warburg  for  hydrogen  iodide. 

As  another  example  of  the  law,  this  time  involving  two  reacting  sub- 
stances, we  may  consider  the  photochemical  reaction  between  chlorine 
and  trichlor-brom-methane  under  the  influence  of  light  of  wave-lengths 
which  are  absorbed  by  the  chlorine.  In  this  reaction  one  molecule  of 
chlorine  and  two  of  the  trichlor-brom-methane  are  found  to  react  for 
each  quantum  absorbed,  the  yield  being  independent  of  the  concentration 
of  the  latter  substance  over  a  wide  range. ^ 

The  mechanism  of  the  reaction  might  be  pictured  as  follows: 

hv  +  CX^  -^    [CI2] 
[CL]  +  CClsBr  -^  CCI4  -f  CI  +  Br 
CI  -f  CCUBr  ->  CCI4  -\-  Br 
2Br  -^  Bra 

^Warburg,  Sitz.-hcr.  prctiss.  Akad.,  p.  314  (1916). 
'Gruss,  Zeitsckr.  fur  Elektrochem.,  29,  144  (1923). 
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Since  the  yield  remains  constant  even  when  the  concentration  of  trichlor- 
brom-methane  is  very  considerably  reduced,  we  must  ascribe  a  long 
enough  life  to  the  activated  chlorine  molecule  [CI2]  to  make  sure  that 
it  will  collide  with  a  trichlor-brom-methane  molecule  before  it  has  lost  its 
activation.  In  the  experiments  under  discussion  it  was  merely  possible 
to  conclude  that  the  mean  life  was  greater  than  2  X  10"^  seconds.  This 
of  course  is  entirely  reasonable.     (See  Chapter  16.) 

353.    Einstein's  Law  in  Cases  of  Photochemical  Sensitization.. — 

There  is  of  course  no  reason  why  the  molecule  which  actually  absorbs 
the  radiation  should  itself  be  one  of  those  entering  into  chemical  re- 
action. By  collisions  of  the  second  kind  ^  it  can  transmit  its  energy 
to  some  other  kind  of  molecule  which  reacts  chemically.  This  explains 
the  great  field  of  photochemical  reactions  which  only  occur  in  the  presence 
of  a  suitable  added  substance  which  can  absorb  the  radiation  employed. 
They  are  said  to  be  photosensitized  by  the  addition  of  this  photocatalyst. 

Here,  too,  in  cases  which  are  not  complicated  by  secondary  factors, 
we  should  expect  Einstein's  law  of  photochemcial  equivalence  to  hold, 
since  each  molecule  of  the  photocatalyst  activated  will  pass  on  its  energy 
to  a  molecule  of  reactant. 

As  an  example  of  such  a  photosensitized  process,  we  may  consider 
the  reaction  between  oxygen  and  trichlor-brom-methane,  which  has  been 
found  to  take  place  when  sensitized  by  bromine  under  the  action  of 
light  of  wave-lengths  4360  and  4070  A.  which  are  absorbed  by  bromine. 
Since  bromine  is  itself  produced  in  the  process,  the  reaction  is  auto- 
catalytic.  Two  molecules  of  the  trichlor-brom-methane  are  found  to  be 
used  up  per  quantum  absorbed,  and  the  yield  is  found  over  a  wide 
range  to  be  independent  of  the  concentration  both  of  oxygen  and  trichlor- 
brom-methane.^  The  course  of  the  reaction  can  be  indicated  as  follows : 
where  it  seems  wisest  not  to  try  to  specify  the  complete  mechanism  of 
the  second  step,  which  presumably  involves  the  deactivation  of  the  excited 
bromine  by  a  collision  of  the  second  kind.  The  lack  of  dependence  of 
yield  on  the  concentration  of  the  reactants  sets  a  limit  of  the  order  10"'' 
seconds  for  the  life  of  the  activated  state. 

Another  example  of  photosensitization  is  given  by  the  decomposition 
of  ozone  in  the  presence  of  chlorine.^     It  is  found  that  two  molecules 

hv-^Br2  ->  [Bra] 

[Bra]  +  {2CCl3Br  -f  O2 }  -»  Br,  +  2COCI2  -f  CL  -f  Bra 

'  The  activated  molecule  of  the  photocatalyst  might  also  be  able  to  enter  into 
temporary  reaction  with  the  reactant.  The  formulation  would  give  the  same  final 
result  as  that  using  the  assumption  of  collisions  of  the  second  kind. 

■*  Gruss,  loc.  cit. 

•Bonhoeffer,  Zeitschr.  fur  Phys.,  13,  94  (1923). 
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of  ozone  decompose  for  every  molecule  of  chlorine  activated  by  absorp- 
tion. Since  the  yield  is  independent  of  the  concentration  of  the  ozone 
betv^^een  0.1  to  5  per  cent,  it  was  possible  to  calculate  a  minimum  life 
of  the  excited  chlorine  of  the  order  of  1.6  X  lO"'^  seconds,  and  to  con- 
clude that  the  excited  chlorine  could  collide  with  as  many  as  a  thousand 
molecules  of  oxygen  without  deactivation.     (See  Section  330,  Chapter  21.) 

354.  Deviations  from  Einstein's  Law. — Many  photochemical  re- 
actions deviate  from  the  law  of  photochemical  equivalence.  This,  how- 
ever, is  itself  to  be  expected  on  the  basis  of  the  very  hypothesis  that 
led  to  Einstein's  law.  Although  the  primary  process  of  absorption  will 
always  lead  to  the  activation  of  one  molecule  per  quantum  of  energy 
abstracted  from  the  radiation,  a  molecule  thus  activated  may  not  always 
succeed  in  entering  the  chemical  change  that  follows  before  it  loses 
its  energy  by  reemission  or  collision  of  the  second  kind.  In  such  cases 
the  yield  will  fall  below  that  predicted  by  the  law.  On  the  other  hand, 
molecules  which  have  been  activated  by  the  absorption  of  radiation  may 
be  the  starting  point  for  a  chain  of  events  involving  many  further  mole- 
cules depending  on  their  concentration.  In  this  case  we  shall  find  devia- 
tions from  the  law  of  photochemical  equivalence  in  the  direction  of  more 
reaction  than  would  be  predicted. 

As  an  example  of  a  reaction  in  which  the  change  is  less  than  would 
be  expected  on  the  basis  of  the  Einstein  law,  we  may  take  the  decomposi- 
tion of  ammonia  by  radiation  in  the  region  2030  to  2140  A.,  where  it 
has  been  found  by  Warburg  ^  that  approximately  four  quanta  are  ab- 
sorbed for  every  molecule  of  ammonia  decomposed. 

As  a  famous  example  of  a  reaction  in  which  more  molecules  are 
decomposed  than  would  be  predicted  by  the  law  of  equivalence,  we 
may  take  the  combination  of  chlorine  and  hydrogen  which  has  long  been 
known  to  take  place  under  the  action  of  sunlight.  In  this  reaction  the 
number  of  molecules  reacting  may  be  as  large  as  10*^  times  the  number 
of  quanta  absorbed.  It  would  be  beyond  the  scope  of  our  present  in- 
terests to  discuss  the  conflicting  mechanisms  which  have  been  proposed 
as  giving  the  specific  steps  of  the  chain  of  processes  which  must  explain 
such  a  result. 

355.  Photochemical  Reaction  Rate. — So  far  we  have  been  discuss- 
ing the  yield  of  photochemical  reactions,  based  on  a  comparison  of  the 
number  of  quanta  actually  absorbed  in  a  given  length  of  time  with  the 
amount  of  chemical  change  which  results.  We  must  now  turn  our  atten- 
tion to  the  rate  at  which  the  reactions  proceed,  with  a  given  intensity 
of  illumination  and  a  given  concentration  of  reactants. 

•Warburg,  Sits.-ber.  preuss.  Akad.,  p.  746  (1911)  ;  p.  216  (1912). 
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In  analogy  with  the  use  of  the  term  specific  reaction  rate  in  ordinary 
thermal  reactions,  we  shall  find  it  convenient  to  define  the  specific  photo- 
chemical reaction  rate  K  for  radiation  of  a  given  frequency  v  as  the 
rate  at  which  the  reaction  proceeds,  with  unit  concentration  of  the  react- 
ing substances  present,  and  in  the  presence  of  radiation  from  an  external 
source  of  frequency  v  and  unit  radiation  density  u„  =  1. 

The  actual  rate  of  course  under  any  given  experimental  conditions 
will  be  dependent  on  the  radiation  density  actually  present  and  on  the 
concentrations  of  the  reacting  substances  employed.  To  express  this  fact 
we  may  when  convenient  employ  an  order  equation,  which  can  be  written 
in  the  form 

-  ^^  =  k,  u.iN^YiNsY  . . .  (634) 

where   the  concentrations   of   the   reacting  substances   N a,  N  b,  etc.,  are 
preferably  expressed  in  numbers  of  molecules  per  cubic  centimeter. 

356.  In  this  equation  the  radiation  density  ti^  enters  to  the  first 
power,  since  so  far  few  if  any  reactions  have  been  found  in  which  the 
rate  of  the  change  is  not  proportional  to  the  intensity  of  illumination. 
Furthermore,  such  proportionality  is  to  be  theoretically  expected  unless 
the  intensity  of  the  illumination  is  raised  so  high  as  to  lead  to  the  partial 
exhaustion  of  molecules  in  the  state  which  is  actually  absorbing  the  radia- 
tion, or  unless  more  than  one  molecule  has  to  be  activated  in  order  for 
the  reaction  to  proceed. 

357.  As  to  the  dependence  of  the  rate  on  concentrations,  in  the  case 
of  a  reaction  which  obeys  the  law  of  photochemical  equivalence,  the  rate 
will  be  dependent  only  on  the  concentration  of  the  substance  which 
absorbs  the  radiation.  The  rate  will  furthermore  usually  be  nearly  propor- 
tional to  the  first  power  of  this  concentration,  unless  there  are  large 
deviations  from  Beer's  law  of  proportionality  between  absorption  and 
concentration.  In  cases  where  the  law  of  photochemical  equivalence  fails, 
the  rate  may  depend  on  the  concentrations  of  more  than  one  of  the  react- 
ing substances  and  in  a  variety  of  ways.  Thus  for  the  reaction  between 
hydrogen  and  chlorine  in  the  presence  of  oxygen  which  acts  as  an 
inhibitor  Chapman  ^  finds  that  the  rate  can  be  expressed  by  an  equation  of 
the  form 

rf(HCl)  _  hjUoJiCUy 

dt     -'''k,{\i,y-^{0,)-\-{C\,) 

where  x  varies  over  a  small  range  of  the  order  zero  to  one  half,  with 
the  ratio  of  the  concentrations  of  hydrogen  and  chlorine. 

'M.  C.  C.  Chapman,  Journ.  Chem.  Soc,  123,  3062  (1923). 
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358.  In  determining  the  value  of  the  specific  photochemical  reaction 
rate  from  actual  experimental  data,  it  is  to  be  recognized  that  u,  is  the 
radiation  density  of  frequency  v  in  excess  of  that  prescribed  by  the  Planck 
radiation  law.  The  actual  value  of  tiy  will  of  course  vary  from  point  to 
point  in  a  mixture  experimentally  illuminated  from  an  external  source. 
It  is  related  to  the  intensity  of  illumination  h  at  any  point  by  the 
equation 

M;,  =—  (635) 

where  c  is  the  velocity  of  light,  and  h  falls  off  in  a  homogeneous  mix- 
ture exponentially  with  the  distance  .r  in  the  familiar  way 

/,  =  (/,)o^-«.^  (636) 

Furthermore,  in  determining  the  value  of  photochemical  reaction  rates, 
it  is  evident  that  the  total  observed  rate  of  change  must  be  corrected 
when  necessary  by  subtracting  the  rate  of  change  due  to  ordinary  thermal 
reaction. 

359.  Theoretical  Expression  for  a  Unimolecular  Photochemical 
Rate. — With  the  help  of  our  knowledge  of  the  mechanism  of  radiation 
absorption,  it  will  now  be  easy  to  obtain  a  theoretical  expression  for  the 
rate  of  unimolecular  photochemical  change. 

Let  Bi^  be  the  chance  per  second  that  a  molecule  in  the  tth.  quantum 
state,  surrounded  by  unit  radiation  density  of  frequency  v  will  absorb  a 
quantum  of  energy  liv  and  hence  go  over  into  an  excited  state. 

Bi„  is  thus  the  value  of  Einstein's  coefficient  of  absorption  for  the 
transition  in  question,  and  with  its  help  we  can  of  course  calculate  the 
number  of  molecules  activated  per  second  and  hence  the  number  that 
would  react  if  every  molecule  activated  should  undergo  the  chemical 
change.  In  order,  however,  to  allow  for  the  possibility  that  some  mole- 
cules, through  deactivation,  fail  to  react  we  shall  take  ^i„  as  the  chance 
of  reaction  for  a  molecule  in  the  excited  state  of  energy  content  £»  -j-  hv. 
Hence  for  the  rate  of  reaction,  due  to  molecules  in  state  i,  we  obtain 

Q-  =  BiAi.Si.Ni  (637) 

Since  molecules  in  more  than  one  quantum  state  may  absorb  the  fre- 
quency V  and  react  ^  we  can  obtain  the  total  rate  of  reaction  by  summing 
over  all  states  i  Doing  this  and  substituting  the  Maxwell-Boltzmann 
expression  for  the  number  of  molecules  in  any  state  i  we  obtain  for  the 
total  rate  of  photochemical  reaction 

*  This  will  be  true  in  the  case  of  a  continuous  series  of  activated  states. 
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£1 

--&  =  ^^- ^-^  (638) 

"^^  -2  Pi  e  -kf 

i 

or  for  the  specific  rate 

2  BiySivpie    kT 

k^=- 377—  (639) 

2  pie     kT 

i 

360.  Lacking  knowledge  of  the  factors  Bi^  and  Si^  we  cannot  of 
course  use  this  expression  for  the  prediction  of  actual  photochemical 
rates,  but  by  differentiation  with  respect  to  the  temperature  we  can 
acquire  information  with  respect  to  the  temperature  coefficient  of  photo- 
chemical reaction  rate. 

Taking  a  logarithmic  differentiation  with  respect  to  T  we  obtain  ^ 


d  log  K  _  1  f^^--^^-^^     ^-^^      ^P^'     '-'  jh 


dT  ky  V.     -—  V.     -- 

Zipie        kT  2.  pie        kT 


(640) 


The  first  term  on  the  right-hand  side  of  this  equation  is  evidently  the 

average   value   of   r^    for  the  molecules   that  actually   react,   in   their 

states  before  activation,  and  the  second  term  the  average  value  for  all  the 
molecules,  so  that  we  can  write 

^  log  fe,  _  g  —  £  .    .. . 

dT      ~    kT''  ^     ^ 

where  the  double  line  indicates  an  average  taken  for  the  molecules  that 
enter  into  reaction  and  the  single  line  an  average  for  all  the  molecules. 
We  have  thus  obtained  a  very  simple  equation  for  temperature  coefficient 
of  photochemical  reaction  rate,  which  we  shall  later  interpret. 

361.  Theoretical  Expression  for  a  Bimolecular  Photochemical 
Rate. — Let  us  next  consider  a  reaction  between  substances  of  two  kinds, 
having  respectively  N  and  N'  molecules  present  per  cubic  centimeter, 
and  let  us  assume  that  reaction  is  brought  about  by  the  photochemical 
activation  of  the  first  of  the  two  substances. 

*  In  the  differentiation  we  have  taken  siv  as  independent  of  the  temperature. 
There  could  be  a  small  tendency  for  sip  to  decrease  with  temperature,  owing  to 
more  frequent  deactivations  without  reaction.     See  Section  317,  Chapter  21. 
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If  Biv  is  the  value  of  Einstein's  coefficient  of  absorption  for  mole- 
cules of  the  first  kind  in  any  quantum  state  i,  we  can  then  write  for  the 
rate  at  which  these  molecules  are  being  activated 

Mir  ^  u^Bi^Ni  (642) 

The  activated  molecules  so  formed  will  be  removed  partly  by  reaction 
and,  in  case  the  law  of  photochemical  equivalence  does  not  hold,  partly 
by  deactivation.  For  their  rate  of  removal  by  reaction  we  can  evidently 
write 

dNi.  =  T/^^ki,jNiMj'  (643) 


dt 

where  T'^ki^,-  is  the  chance  of  collision  and  reaction  between  a  molecule 
of  the  first  kind  in  the  state  iv  and  a  molecule  of  the  second  kind  in  the 
state  y.  The  rate  of  deactivation  can  be  put  proportional  to  the  number 
of  molecules  in  the  state  iv 


dNu, 
'  dt 


=  ai.Nu  (644) 


where  a^^  is  the  sum  of  the  coefficients  of  deactivation  to  all  lower  states. 
(See  Chapter  21,  Section  314.) 

For  the  steady  state  of  reaction,  since  the  number  of  molecules  in 
the  activated  state  iv  is  always  small,  the  rate  of  formation  as  given  by 
equation  (642)  must  be  equal  to  the  total  rate  of  removal  as  given  by 
equations  (643)  and  (644),  so  that  we  can  combine  these  equations  and 
solve  for  the  concentration  of  activated  molecules  Ni^.    We  obtain 

The  total  rate  of  the  reaction  can  now  be  obtained  by  resubstituting  this 
expression  into  (643)  and  summing  up  for  all  states  i.  We  thus  obtain 
for  the  rate  of  reaction 

^^  /r^2  ki,jN/u,Bi,Ni\ 

~W^^  [  vh  ki^jN/  +  Gi,  J  ^^"^^^ 

362.  In  case  the  chance  for  an  activated  molecule  to  react  is  large 
compared  with  the  chance  of  deactivation,  the  second  term  in  the  de- 
nominator will  be  small  compared  with  the  first,  and  the  expression  will 
reduce  to 
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^'t£L=^  u.B,.N,  =  -^ N  (647) 

^piC       kT 
i 

The  rate  will  then  be  dependent  only  on  the  concentration  A'"  of  mole- 
cules of  the  first  kind,  and  every  molecule  that  is  activated  will  react  in 
accordance  with  the  law  of  photochemical  equivalence. 

On  the  other  hand,  in  the  extreme  case  that  the  chance  of  reaction  is 
very  small  compared  with  that  of  deactivation,  equation  (646)  will  assume 
the  form 

-~  =  ^.r^2  (^^  BiM.N/) 
at  i    \  i    <^i»  / 

2^^  Bi,Pie~-rp/e-Tr) 

=  u.i-i.i.  /    ""''      ^  jjr^  (648) 

i^Pie-kf\  i^p/e-kf\ 

so  that  the  rate  becomes  proportional  to  both  concentrations.  Since  the 
relative  chance  of  reaction  to  deactivation  can  be  increased  by  increas- 
ing the  concentration  N',  we  may  expect  to  encounter  order  equations 
of  form  intermediate  between  (647)  and  (648)  as  we  vary  the  concentra- 
tion, as  well  as  different  variations  from  the  yield  calculated  from  the 
law  of  photochemical  equivalence. 

363.  The  general  treatment  of  the  temperature  coefficient  of  a  re- 
action such  as  we  are  considering  could  be  obtained  by  differentiating 
the  general  equation  (646)  with  respect  to  T,  but  the  result  is  pretty  com- 
plicated. It  will  be  sufficient  for  our  cases  to  consider  the  two  limiting 
cases  given  by  equations  (647)  and  (648). 

Referring  to  equation  (647)  and  taking  the  specific  reaction  rate  /v\, 
which  is  the  factor  by  which  u„N  must  be  multiplied  to  get  the  actual 
rate,  we  easily  obtain  by  logarithmic  differentiation  ^°  just  as  in  the 
derivation  of  equation  (641) 

where  e  is  the  average  energy  of  the  molecules  of  the  photoactive  sub- 
stance that  pick  up  radiation  and  react,  and  8  is  the  average  energy  of  all 
these  molecules. 

^"The  change  of  o.iv  with  temperature  is  neglected,  see  Section  317,  Chapter  21. 
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Using  the  value  of  k^  coming    from   equation    (648)    we   obtain  by 
logarithmic  differentiation  with  respect  to  T  ^° 


d  log  k^ 


1  r/^SS  ^B,.p,e  -AT  ^  p/e-  ff 


dT      ~  2T 


=  -^  + 


k%  (  HpiC     kT  \  I  ^p/e     It  \ 

.  i  j  ai„ kT2 

k,  (  YpiC ~^)( ^P/e  ~^) 


^piC        kT  ^pj'e~kf 


(650) 


] 


But  by  the  principles  of  averaging,  so  often  used  before,  this  evidently 
becomes 

c^logA",  _     1  e  — £  +  £^  —  £^ 

dT      ~  2T  ^  kT^  ^     ^ 

where  e  is  the  average  energy  of  the  molecules  of  the  photoactive  sub- 
stance that  pick  up  radiation  and  react  and  e  the  average  energy  of  all 
the  molecules  of  photoactive  substance,  while  e'  is  the  average  energy 
of  the  molecules  of  the  other  substance  entering  into  the  reaction,  and  e' 
the  average  energy  of  all  of  them. 

Comparing  equations  (649)  and  (651)  for  the  two  limiting  cases,  we 
see  that  when  the  law  of  photochemical  equivalence  is  followed,  the  tem- 
perature coefficient  depends  only  on  the  energies  of  molecules  of  the  photo- 
active substance,  while  in  the  other  limiting  case  we  have  to  include 
terms  depending  on  the  energies  of  molecules  of  the  other  substance  as 
well  as  the  term  1/2  T.  For  intermediate  cases  we  should  expect  inter- 
mediate results. 

364.  It  is  evident  that  other  mechanisms  of  photochemical  reaction 
besides  the  unimolecular  and  bimolecular  ones  discussed  could  be  theo- 
retically treated  by  similar  methods. 

365.  The  method  of  treating  cases  of  photochemical  steady  states  in 
which  a  thermodynamic  equilibrium  is  displaced  by  illumination  will  also 
be  evident.  To  do  this  we  shall  merely  have  to  equate  the  rates  for  the 
different  reverse  processes  which  are  occurring. 


296  STATISTICAL  MECHANICS  [Ch.  22 

366.  Comparison  of  Theoretical  and  Experimental  Results  as 
to  the  Temperature  Coefficient  o£  Photochemical  Reaction  Rate. — 

Examining  equations  (641),  (649),  and  (651)  for  the  temperature  co- 
efficient of  photochemical  reaction  rate,  we  see  that  if  we  neglect  the 
small  term  l/2r^  they  can  all  be  written  in  the  form 

d  log  k^  _  e  — g  .^c^. 

~dT^-    kT^  ^^^^^ 

where  (s — s)  is  the  difference  between  the  average  energy  of  the  mole- 
cules that  enter  into  the  reaction  and  that  of  all  the  molecules. 

The  equation  has  the  same  form  as  equation  (613)  in  the  last  chapter 

for  the  temperature  coefficient  of  thermal  reactions.  The  quantity  8  is, 
however,  quite  different  iii  the  two  cases,  since  in  the  equation  for  thermal 

reactions  e  was  the  average  energy  of  the  molecules  that  react  as  they 
exist  in  the  activated  state  that  leads  to  reaction,  while  in  equation  (652) 

£  is  the  average  energy  of  the  molecules  that  react  before  they  have 
picked  up  the  radiation  which  leads  to  reaction. 

With  the  help  of  equation  (652),  we  can  now  interpret  existing  data 
on  temperature  coefficients. 

367.  The  most  striking  fact  concerning  the  effect  of  temperature  on 
photochemical  reaction  rates  is  the  extremely  great  number  of  reactions 
which  have  nearly  zero  temperature  coefficient.  Plotnikow  ^^  in  his  collec- 
tion of  existing  information  on  photochemistry  lists  21  photochemical  re- 
actions for  which  the  maximum  value  of  {d  log  k^/dT)  is  approximately 
0.008. 

The  existence  of  such  a  group  of  reactions  is  readily  understood  on 
the  basis  of  equation  (652),  since  in  accordance  with  that  equation  we 
may  write 

dT      -    kT'    -^  ^^^^^ 

and  conclude  that  for  this  great  group  of  well-known  photochemical 
reactions  the  average  energy  of  the  molecules  which  enter  into  the  re- 
action is  practically  the  same  as  the  average  energy  of  all  the  molecules. 
This  result  is  entirely  understandable ;  it  means  that  the  average  mole- 
cule has  as  good  a  chance  of  picking  up  radiant  energy  and  becoming 
activated  as  the  molecules  in  some  special  quantum  state  far  removed 
in  energy  content  from  the  average.  For  reactions  of  this  group,  pre- 
liminary partial  activation  is  not  necessary  in  order  that  a  molecule  of 

"  Plotnikow,    "Lehrbuch    dcr    Photochemie,"    de    Gruyter    and    Co.,    Berlin    and 
Leipzig,  1920,  p.  62. 
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the  photoactive  substance  may  absorb  its  quantum  and  react,  nor  is  pre- 
liminary activation  necessary  for  the  molecules  of  any  other  substance 
involved. 

368.  The  above  discussion  also  makes  it  clear  that  v^^e  might  expect 
to  find  cases  in  which  preliminary  activation  of  the  molecules  w^ould  be 
advantageous  in  promoting  reaction.  This  may  arise  because  molecules 
in  the  lower  quantum  states  are  not  in  a  condition  to  absorb  radiant 
energy  of  the  frequency  used,  or  because  the  energy  level  which  they 
attain  after  the  absorption  is  not  high  enough  to  lead  to  chemical  reaction. 

In  such  cases  the  average  energy  e  of  the  molecules  which  enter  into 
the  reaction  will  be  greater  than  the  average  energy  e  of  all  the  molecules 
and  we  should  find 

^  =  %|>0  (654) 

As  a  matter  of  fact,  Plotnikow  lists  17  reactions  in  which  the  value  of 
{d  log  k,/dT)  varies  from  about  0.016  to  0.040. 

This  too  is  entirely  reasonable  on  the  basis  of  our  picture.  A  value 
of  {d  log  ky/dT)  =  0.040  leads  at  room  temperature  to  a  value  of  about 
7000  calories  per  mol.  This  corresponds  to  a  value  of  0.3  volt  per  mole- 
cule which  is  an  entirely  reasonable  figure  for  the  difference  in  energy 
contents  between  successive  quantum  levels  in  a  molecule, 

A  photochemical  reaction  taking  place  at  room  temperature  in  a 
homogeneous  system  with  such  a  temperature  coefficient  might  be  very 
reasonably  interpreted  by  assuming  that  the  molecules  in  the  lowest 
quantum  state  were  not  photoactive,  and  that  molecules  had  to  be  in  a 
higher  quantum  state  with  a  preliminary  activation  corresponding  to 
0.3  volt  in  order  to  absorb  light  and  react. 

The  process  of  reaction  itself  would  then  consist  in  the  picking  up 
of  a  quantum  hv  by  molecules  in  this  higher  quantum  state  followed  by 
reaction.  If  the  photoactive  light  has  the  wave-length  500  fxfi  this  denotes 
a  further  increase  in  energy  level  corresponding  to  an  activation  of  2.5 
volts  per  molecule,  again  an  entirely  reasonable  figure. 

369.  Before  this  explanation  of  these  small  temperature  coefficients 
could  be  finally  accepted,  however,  it  would  be  necessary  to  show  that 
we  could  still  account  for  the  absolute  magnitude  of  the  rates  if  it  is 
mainly  molecules  in  higher  quantum  states  that  are  involved.  More- 
over, these  small  temperature  coefficients  might  not  be  solely  due  to  the 
above  mechanism,  since,  as  the  experiments  are  actually  carried  out, 
the  velocity  of  diffusion  of  molecules  into  a  strongly  illuminated  zone 
may  be  of  importance.     For  aqueous  solutions  the  viscosity  decreases 
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in  the  neighborhood  of  20  per  cent  per  10°  rise  in  temperature.  If  we 
assume  the  same  proportional  increase  in  reaction  rate  this  would  give 
us  a  value  of  {d  log  kJdT)  of  about  0.018,  which  falls  in  the  region  of 
discussion.  Nevertheless  small  temperature  coefficients  of  the  same  order 
occur  in  gas  reactions  where  the  rate  of  diffusion  would  be  much  less 
affected  by  temperature,  so  that  the  viscosity  explanation  could  not  be  a 
general  one. 

370.    If  we  differentiate  equation  (652)  with  respect  to  the  frequency 
V,  we  obtain 


;(^H=^i 


Since  8  is  the  average  energy  of  the  molecules  that  actually  react,  this 
quantity  may  decrease  with  frequency,  for  at  higher  frequencies  the 
magnitude  of  the  quantum  hv  is  increased  and  molecules  from  a  lower 
quantum  state  may  be  raised  to  a  level  where  reaction  is  possible.  This 
means  that  we  may  expect  to  find  cases  where  the  second  derivative 
in  question  is  negative,  provided  {d  log  ky/dT)  is  not  already  zero. 

Experimental  data  of  the  kind  in  question  are  not  numerous  nor  cer- 
tain. Nevertheless,  referring  once  more  to  Plotnikow's  treatise,  express- 
ing temperature  coefficients  as  the  ratio  between  the  rates  at  tempera- 
tures 10°  apart,  we  find  that  silver  citrate  paper  has  the  temperature 
coefficient  1.19  in  the  blue  and  1.07  in  the  ultraviolet;  the  phototropic 
substance  salicylidenenaphthylamine  has  the  values  1.8  in  the  green,  1.45 
in  the  blue  and  1.39  in  the  violet;  and  the  reaction  between  chlorine 
and  hydrogen  has  the  values  1.50  for  green  light  (550-530  fif-i),  1.31 
for  blue  light  (490-470  ////)  and  1.21  for  violet  light  (400-350  /a/i).  These 
results  if  reliable  are  in  complete  agreement  with  the  theory  presented 
above.^- 

"  For  the  first  presentation  of  the  suggested  theory  of  temperature  coefficients, 
see  Tolman,  Journ.  Amcr.  Chem.  Soc,  45,  2285   (1923). 
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The  Relation  between  Statistical  Mechanics  and 
Thermodynamics 

371.  Introduction. — It  has  been  made  apparent  by  the  foregoing 
chapters  that  statistical  mechanics  is  capable  of  treating  all  the  problems 
ordinarily  handled  by  thermodynamics.  It  is  true  that  the  language  of 
thermodynamics  is  oftentimes  simpler  than  that  of  statistical  mechanics, 
and  its  equations  better  adapted  for  the  substitution  of  empirical  values 
of  the  parameters  than  are  the  equations  of  statistical  mechanics.  For 
this  reason  we  have  not  hesitated  in  Chapters  9  and  13  to  show  the 
bearing  of  some  of  our  results  on  ordinary  thermodynamic  calculations. 
This  was  due,  however,  to  practical  rather  than  theoretical  considerations. 

In  addition  to  the  possibility  of  treating  what  are  usually  considered 
as  thermodynamic  problems  by  statistical  mechanical  methods,  we  have 
also  seen  that  the  latter  science  permits  the  theoretical  prediction  of 
the  values  of  many  quantities  which  thermodynamics  must  take  as 
empirical  parameters,  and  furthermore  permits  the  treatment  of  rates 
of  physical-chemical  change,  a  problem  which  thermodynamics  cannot 
attack.  It  seems  evident  then  that  we  must  regard  the  science  of  statistical 
mechanics  as  the  more  fundamental,  and  investigate  the  possibility  of  a 
direct  deduction  of  the  axioms  of  thermodynamics  from  the  principles 
of  statistical  mechanics  itself.  This  is  the  problem  which  we  shall  treat 
in  the  present  chapter. 

The  most  satisfactory  method  of  developing  the  relations  between 
thermodynamics  and  statistical  mechanics  is  not  as  yet  a  matter  of  uni- 
versal agreement.  Hence  we  shall  content  ourselves  in  the  present  chapter 
with  a  partially  complete  account  of  some  of  the  different  modes  of 
attack  that  have  been  undertaken.  The  discussion,  however,  will  be 
sufficient  to  show  interrelations  between  different  proposals,  and  to  pro- 
vide a  basis  for  the  treatment  of  certain  important  physical-chemical 
problems  where  it  is  advantageous  to  commence  the  discussion  on  a 
statistical  mechanical  basis,  and  then  translate  the  results  into  thermo- 
dynamic language  so  that  the  discussion  can  be  completed  by  thermo- 
dynamic methods. 

299 
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372.  The  Two  Laws  of  Thermodynamics. — The  whole  of  classical 
thermodynamics  may  be  regarded  as  obtainable  from  two  axioms  known 
as  the  first  and  second  laws. 

The  first  lazi^  may  be  written  in  the  form 

dE  =  dQ~  dW  (656) 

and  states  that  the  energy  increase  in  a  macroscopic  system  is  equal  to 
the  heat  which  flows  into  it  less  the  work  which  it  does  on  its  surroundings. 

The  above  equation  may  be  regarded  as  an  immediate  consequence 
of  atomistics,  provided  we  merely  interpret  the  heat  given  to  the  system 
as  energy  transferred  by  random  molecular  processes,  and  assume  that 
the  law  of  the  conservation  of  energy  applies  either  directly  to  individual 
molecular  processes  or  at  the  worst  applies  to  the  total  result  of  their 
overall  behaviour. 

To  state  the  second  law,  we  may  define  the  infinitesimal  increase 
in  the  entropy  of  a  system  dS,  accompanying  any  reversible  change  in 
state  in  terms  of  temperature  T  and  heat  absorbed,  by  the  equation 

dS  =  ^  (657) 

The  second  law  of  thermodynamics  now  requires  that  — ^  shall  be  a 

perfect  differential  for  any  reversible  path,  so  that  the  entropy  6^  may  be 
regarded  as  a  function  of  the  state  for  any  system  which  is  in  equilibrium. 
The  law  further  requires  that  T  shall  be  a  quantity  such  that  heat 
will  only  flow  from  systems  of  higher  to  those  of  lower  temperature, 
and  that  for  actual  nonreversible  changes  in  the  state  of  a  system,  in 
which  the  external  parameters  of  the  system  are  varied,  dQ  being  kept 
zero,  the  entropy  S  shall  increase,  instead  of  remaining  constant  in 
accordance  with  equation  (657),  which  applies  only  to  reversible  processes. 
Equations  (656)  and  (657)  may  now  be  combined  to  advantage  in  the 
form 

dE  =  TdS  —  A^da,  —  Aodaz  —  . .  (658) 

where  the  external  work  done  by  the  system  on  its  surroundings  has  been 
expressed  in  terms  of  the  external  forces  acting  on  the  system  ^j.  An, .... 
and  the  corresponding  displacements  da-^,  rfag 

373.  Evidently  to  obtain  a  statistical  mechanical  interpretation  of 
thermodynamics  we  must  secure  statistical  mechanical  analogues  for  the 
quantities  occurring  in  equation  (658),  which  will  obey  an  equation  of 
the  same  form  and  will  otherwise  have  properties  similar  to  those  of  the 
thermodynamic   quantities.     The   fact   that   more   than   a  single    set   of 
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pretty  satisfactory  analogues  have  been  found  explains  the  possibility  of  a 
variety  of  modes  of  treating  the  problem. 

We  shall  first  describe  in  outline  the  three  proposals  of  Gibbs  in 
v^^hich  analogies  v^ere  shov^^n  betv^een  the  quantities  in  equation  (658) 
and  quantities  which  describe  the  properties  of  canonical  and  micro- 
canonical  ensembles  of  systems.  We  shall  then  discuss  in  somewhat  more 
detail  the  results  that  can  be  obtained  by  the  more  direct  method  of 
Boltzmann  in  considering  those  statistical  mechanical  properties  of  a 
single  system  which  are  analogous  to  the  quantities  in  equation  (658). 

374.  The  First  Analogue  of  Gibbs. — The  first  analogue  of  Gibbs 
was  based  on  the  consideration  of  a  canonical  ensemble  of  systems  of 
structure  identical  with  that  of  the  system  of  interest  and  distributed  in 
the  y-space  with  the  density 

Q  =  Ne  "e—  (659) 

where  N  is  the  total  number  of  systems  in  the  ensemble,  E  the  energy 
of  a  single  system  and  W  and  &  are  constants,  having  the  dimensions  of 
energy,  the  latter  being  called  the  modulus  of  distribution. 

With  this  distribution  in  the  /-space  the  number  of  systems  having 
coordinates  and  momenta  in  any  region   dQ^  ....  dPm  will  evidently  be 

dN  =  Ne~Q^  dQ^....  dP^  (660) 

and  hence  by  integration  over  the  whole  phase  space,  we  must  have  the 
result 

j'....Je~e—dQi...dPm=l  (661) 

Furthermore  for  the  average  of  any  property  of  the  systems  P  we 
shall  evidently  have 

P  z=J. . .  .Je^^PdQ,  ...dPm  (662) 

375.  It  should  also  be  noted  that  for  such  a  canonical  ensemble  of 
systems,  having  many  degrees  of  freedom,  most  of  the  representative 
points  will  be  in  regions  of  the  phase  space  where  the  energy  E  does  not 
differ  greatly  from  its  most  probable  value.  This  point  has  been  especially 
analyzed  by  Gibbs,  but  it  will  be  seen  qualitatively  from  the  form  of 
equation  (660),  that  very  large  values  of  E  will  not  be  frequent  because 
of  the  effect  of  the  exponential  factor  and  very  small  values  of  E  will 
not  be  frequent  owing  to  the  small  region  in  the  phase  space  corre- 
sponding to  low  energies.  It  was  hence  possible  for  Gibbs  to  show  that 
the  great  majority  of  the  systems  in  the  ensemble  have  a  value  of  E  which 
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does  not  differ  sensibly  from  the  average  value  E,  and  this  may  be  ex- 
tended to  the  relation  betv^een  other  properties  of  the  systems  and  their 
average  values. 

376.  We  are  now  ready  to  develop  the  first  analogy  of  Gibbs.  Let 
us  consider  the  modulus  of  distribution  0  as  a  variable  parameter,  so 
that  by  changing  its  value  y^t  can  obtain  a  variety  of  different  canonical 
ensembles,  and  let  us  also  consider  that  the  energy  E  is  dependent  on 
certain  external  parameters  %  .  .  .  .  a„,  of  the  nature  of  volume,  height, 
or  other  macroscopic  quantities  of  an  ordinary  mechanical  nature.  Let 
us  now  make  an  infinitesimal  (reversible)  change  in  the  values  of  the 
parameters  @,  ax .  . .  .  On-  This  will  give  us  a  new  canonical  distribution, 
and  since  equation  (661)  must  then  still  hold,  we  may  write 


j...j:-^[^iw-^_de  +  §,de 


Introducing  the  principles  of  averaging  given  by  equation  (662)  and 
noting  that  the  external  forces  corresponding  to  the  parameters  a^  . .  .  .  «„ 
will  be  given  by  the  equations 

-f  =  ^' '-Il  =  ^"  (^*) 

this  may  be  rewritten  in  the  form 


dW 
0 


—  ^2d&+§^d0-\-^i  A.dax  +  ....  +  Andanl  =  0 


(665) 


Let  us  now  define  a  quantity  S,  called  by  Gibbs  the  average  value  of  the 
negative  index  of  probability,  by  the  equation 


givmg  us 


-      E V 

S  =  —~  (666) 


dS  =  ^-^~§de  +  ^,d0  (657) 


Combining  (665)  and  (667)  and  transposing  we  may  then  write 

dE  =  0dS  —  A,da,  —  Aoda^  — (668) 

We  have  thus  obtained  an  equation  of  exactly  the  same  form  as  the 
thermodynamic  equation   (658),  in  which  the  energy  E,  temperature  T, 
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entropy  S,  and  external  forces  A.^,  A^.  .  .  .  oi  2i  single  system,  are  replaced 
by  the  average  energy  E,  modulus  of  distribution  0,  average  negative 
index  of  probability  S,  and  average  values  of  the  external  forces 
A^,  Ar,....  for  all  the  systems  in  the  ensemble. 

To  complete  the  analogy,  it  was  shown  by  Gibbs  that  if  the  sys- 
tems of  two  ensembles  are  brought  into  a  condition  analogous  to  thermal 
contact,  the  average  result  is  a  passage  of  energy  from  the  ensemble  of 
the  greater  modulus  to  that  of  the  less.  And  it  was  further  shown  that 
if  the  external  parameters  for  the  systems  of  a  canonical  ensemble  are 
subjected  to  abrupt  changes  (i.e.  nonreversible  changes),  the  ultimate 
result  after  sufficient  lapse  of  time  will  be  a  new  canonical  ensemble 
with  a  greater  value  of  S} 

We  have  thus  obtained  a  fairly  complete  correspondence  between 
the  behavior  of  certain  average  quantities  in  an  ensemble  of  systems 
and  the  behavior  of  thermodynamic  quantities  for  a  single  system.  And 
since  we  have  seen  above  that  these  average  quantities  do  not  differ 
sensibly  from  the  individual  quantities  which  we  may  expect  to  encounter 
if  we  pick  systems  at  random  from  the  ensemble,  we  may  regard  the 
analogy  as  furnishing  a  means  of  predicting  the  behavior  of  single  systems. 
It  must  be  remarked,  however,  that  the  method  seems  formal  and  abstract. 

377.  The  Second  Analogue  of  Gibbs, — The  second  analogue  of 
Gibbs  is  based  on  the  consideration  of  a  micro  canonical  ensemble  of  sys- 
tems of  structure  identical  with  that  of  interest  all  having  within  the 
narrow  limit  dE  the  same  energy  E  as  that  of  the  system  of  interest. 
Defining  V  now  as  the  total  volume  of  the  phase  space  corresponding 
to  states  of  less  energy  than  E,  Gibbs  was  able  to  obtain  the  equation 

dE  =  j^^  d\ogV  —  A^\da^  —  A.\dao  —  ....  (669) 

where  (dE/d  log  V)  is  now  the  analogue  of  temperature,  log  V  of 
entropy,  and  A^\,  ^a],  etc.,  are  the  average  values  for  the  external  forces 
of  all  the  systems  in  the  microcanonical  ensemble.  It  was  shown,  however, 
by  Gibbs  that  although  the  quantity  (dE/d  log  V)  had  some  of  the  prop- 
erties of  temperature,  it  was  not  true  in  general  that  the  value  of  this 
quantity  would  remain  constant  when  two  systems,  having  the  same  value 
of  this  quantity,  are  combined  to  form  a  third  system. 

^  It  must  be  remarked  that  it  seems  doubtful  if  the  rate  of  approach  to  the 
new  increased  value  of  6"  could  be  shown  to  be  fast  enough  without  introducing 
further  hypothesis  than  those  of  Gibbs.  See  P.  and  T.  Ehrenfest,  Encyklop.  math. 
Wiss.,  IV  2,  II,  Heft  6,  page  61. 
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378.  The  Third  Analogue  of  Gibbs. — The  third  analogue  of  Gibbs 
was  also  based  on  a  consideration  of  a  niicrocanonical  ensemble.  Defining 
a  new  quantity  4>,  by  the  relation 

<^  =  log  ^  (670) 

he  obtained  the  equation 

dE  =  ^d4>  —  A^da^  —  Aoda^  —  ....  (671 ) 

d(f> 

where  (dE/dcj))  is  now  the  analogue  of  temperature,  <j)  of  entropy,  and 

A^,  A^. .  . .  the  most  probable  values  of  the  external  forces  on  the  sys- 
tems of  the  ensembles. 

Similar  difficulties  as  to  the  use  of  the  quantity  (dE/d<f>)  for  tem- 
perature were  found  as  in  the  preceding  analogue. 

Gibbs  apparently  considered  both  these  last  two  analogues  as  faulty, 
and  regarded  the  canonical  ensemble  as  superior  to  the  niicrocanonical 
for  the  representation  of  thermodynamic  properties. 

379.  Boltzmann's  Relation  between  Entropy  and  Probability. — 

We  may  now  turn  from  the  attempt  to  represent  thermodynamic  quanti- 
ties by  the  properties  of  an  ensemble  of  systems,  and  proceed  to  the 
possibility,  initiated  by  Boltzmann,  of  relating  thermodynamic  quantities 
directly  to  mechanical  quantities,  which  must  indeed  be  studied  by  statistical 
mechanical  methods,  but  nevertheless  themselves  express  the  properties  of 
a  single  system. 

380.  Of  the  three  thermodynamic  quantities  S,  Q  and  T,  it  is  evi- 
dent that  the  entropy  5"  is  the  only  one  which  will  need  elaborate  study. 
The  heat  Q  absorbed  by  a  single  system  can  be  immediately  identified 
on  the  statistical  mechanical  basis  with  the  energy  which  the  system  re- 
ceives by  molecular  impacts  or  other  elementary  disordered  processes 
which  defy  ordinary  observational  methods.  And  the  temperature  T 
may  be  regarded  as  defined  by  the  pressure  of  a  sufficiently  dilute  gas 
placed  in  contact  with  the  system  of  interest,  using  the  methods  of 
Chapter  4,  which  permitted  us  to  introduce  the  temperature  immediately 
into  statistical  mechanical  equations. 

Hence  we  may  turn  at  once  to  entropy,  and  consider  the  relation 
between  the  entropy  of  a  system  and  the  probability  of  its  state  which 
was  obtained  by  Boltzmann  as  the  result  of  the  following  somewhat  for- 
mal reasoning. 

381.  From  a  thermodynamic  point  of  view,  the  entropy  of  an  isolated 
system  can  only  increase  if  the  maximum  value  has  not  been  reached.     On 
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the  other  hand,  from  a  statistical  mechanical  point  of  view,  isolated  sys- 
tems may  be  expected  to  change  from  states  of  lesser  probability  to 
those  of  greater  proliability.  Hence  it  would  seem  natural  to  regard 
the  entropy  6*  of  a  system  as  some  function  of  the  probability  \V  of 
its  state.     We  may  express  this  fact  by  the  functional  equation 

S  =  <i>{W)  (672) 

The  properties  of  entropy  and  probability,  however,  permit  an  im- 
mediate solution  of  the  form  of  this  functional  equation.  If  we  consider 
two  separate  systems  of  entropies  S-^^  and  So  and  probabilities  W-^  and 
Wo,  we  may  write 

S,  =  <t>{]]\) 

(673) 

s,:=4>{iv.;) 

and  since  entropies  are  additive  and  independent  probabilities  are  multipli- 
cative we  must  have  for  the  combined  system 

S,^S.  =  4>{W,W.)  {67  A) 

The  only  solution,  however,  for  equations  (673)  and  (674)  is  of  the  form 

S  =  k\og  W  -\-a  (675) 

where  k  and  a  are  constants.  Of  these  the  constant  k  merely  depends 
on  the  interrelation  between  the  scales  of  measurement  for  5"  and  IV, 
and  it  is  best  to  take  it  the  same  as  the  k  already  introduced.  The  rela- 
tion between  a  and  the  third  law  will  be  discussed  later. 

The  above  is  Boltzmann's  famous  equation  which  serves  as  a  bridge 
between  thermodynamic  and  statistical  mechanical  modes  of  reasoning. 

If  we  consider  the  change  in  entropy  accompanying  a  change  in  the 
state  of  a  single  system  we  may  rewrite  equation   (675)    in  the  form 

AS  =  k\og^p^  (676) 

382.  Entropy  Increase  and  Boltzmann's  //-Theorem. — The  rela- 
tion of  entropy  to  Boltzmann's  quantity  H  is  now  immediately  evident. 
In  deriving  the  Maxwell-Boltzmann  distribution  law,  we  found  that  the 
y-weight  of  a  given  state  of  gas  consisting  of  A''  molecules  was  given  by 
the  expression  (Chapter  4,  equation  (38)) 

\ogy  =  N  log  A^  —  S.Vi  log  A^i  -f  log  do  (677) 

i 

where  A'i  is  the  number  of  molecules  in  the  fth  region  of  volume  do 
into  which  we  have  divided  the  //-space  for  the  molecules.  Hence  since 
probability  and  /-weight  are  proportional  we  may  write 
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S  =  —  k^Ni\ogNi^  const.  (678) 

t 

In  Chapter  18,  however,  Boltzmann's  H  was  defined  by  equation   (402), 
as  having  the  value 

H  =  ^Ni  log  Ni  (679) 


So  that  we  can  write 


S  =  —  kH-\-  const.  (680) 


The  tendency  for  entropy  and  probability  to  increase  to  a  maximum 
and  for  Boltzmann's  H  to  decrease  to  a  minimMm  are  thus  all  seen  to  be 
different  expressions  of  the  same  principle. 

383.  Different  Expressions  for  Entropy  of  Systems  at  Equi- 
librium.— It  will  be  noticed  from  the  foregoing  that  Boltzmann's  treat- 
ment successfully  extends  the  notion  of  entropy  to  systems  which  are  not 
in  equilibrium,  an  extension  which  could  not  be  made  by  the  Gibbs' 
method,  and  one  which  meets  certain  difficulties  in  the  field  of  pure 
thermodynamics. 

In  the  case  of  systems  which  are  in  a  state  of  equilibrium,  several 
different  expressions  (which  are  identical  for  a  system  with  a  large 
number  of  degrees  of  freedom),  can  be  used  for  the  Boltzmann  result. 

Starting  with  the  Boltzmann  equation 

S  =  k\oglV  -\-a 

since  probability  is  proportional  to  /-weight  we  can  write 

6"  =  ib  log  7  +  const.  (681) 

where  y  is  the  volume  in  the  phase  space  which  corresponds  to  the  state 
of  interest.  For  a  system  which  has  a  large  number  of  degrees  of  free- 
dom, we  found,  however,  in  Chapter  4,  that  practically  the  whole  volume 
of  the  /-space  between  the  limits  E  and  E  -{-  dE  available  to  the  micro- 
canonical  ensemble,  corresponded  to  states  of  the  system  which  did  not 
differ  sensibly  from  the  Maxwell-Boltzmann  state  of  maximum  prob- 
ability. Hence  without  sensible  difference  we  can  replace  equation  (681) 
for  a  system  which  is  in  its  equilibrium  state  by 

Sz=:k\og~J^  + const  (682) 

where  dV/dE  is  the  change  in  volume  of  the  phase  space  with  energy  at 
the  energy  value  of  interest. 

With  a  system  of  many  degrees  of  freedom,  however,  a  still  fur- 
ther change  is  possible.    The  volume  V  will  be  that  of  a  many  dimensional 
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solid  and  the  logarithm  of  the  surface  dV/dE  will  differ  by  an  in- 
appreciable amount  from  the  logarithm  of  the  whole  volume  V.  This 
can  be  seen  for  example  from  a  consideration  of  the  expressions  for 
many  dimensional  spheres  and  ellipsoids  given  in  Sections  158  and  159 
of  Chapter  12.  Hence  again  without  sensible  difference  we  can  take  for  a 
system  whicJi  is  in  its  equilibrium  state 

S  =  k  log  V  +  const.  (683) 

Equations  (682)  and  (683)  give  the  same  expressions  for  entropy 
as  those  in  the  third  and  second  analogues  of  Gibbs.  The  present 
method  has  the  advantage,  however,  that  we  do  not  now  need  to  find 
properties  of  the  ivJiolc  microcanonical  ensemble  which  are  analogous  to 
temperature  and  external  force. 

384.    Application  of  the  Boltzmann  Method  to  a  Specific  Case. — 

It  will  be  desirable  to  apply  the  Boltzmann  method  to  a  specific  simple 
case.^  This  will  have  the  advantage  of  showing  that  the  Boltzmann 
equation  for  entropy  gives  an  expression  which  does  have  the  properties 
of  the  thermodynamic  quantity  S,  and  will  also  illustrate  the  procedure 
that  can  be  undertaken,  when  statistical  mechanical  conclusions  are  for 
convenience  to  be  translated  into  thermodynamic  language,  for  the  treat- 
ment of  actual  physical  chemical  problems. 

Consider  a  system  of  total  energy  E,  composed  of  N  molecules  of  a 
single  kind,  each  with  n  degrees  of  freedom  which  may  be  regarded  as 
fully  excited,  contained  in  a  vessel  of  volume  V. 

For  the  number  of  molecules  at  equilibrium  in  any  region  of  the  ^u-space 

do  =  dq,....  dpn  (684) 

we  may  write  in  accordance  with  equations  (73)  and  (74)  in  Chapter  4, 
the  Maxwell-Boltzmann  expression  in  the  form  ^ 

dN  =  eTr-do  (685) 

where  the  parameter  ip  is  defined  by  the  equation 

e  kT  =  -, ^ (686) 

I  e~"iyl'  do 

the  integration  being  taken  over  all  regions  of  the  /<-space. 

'See  Tolman,  Journ.  Anier.  Cheni.  Soc,  44,  75  (1922). 

'  The  quantities  i/-  and  e  occurring  in  this  expression  for  the  distribution  of 
molecules  in  a  system  should  be  carefully  distinguished  from  the  -I'  and  E  in  the 
Gibbs'  expression  for  the  distribution  of  systems  in  a  canonical  ensemble. 
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For  the  average   value   of   any  property  P   of   the   molecules,   which 
depends  on  their  location  in  the  /ti-space,  we  may  evidently  write 


NP  = 


/^  —  e 
e^PTPdo  (687) 


where  the  integration  is  to  be  taken  over  the  whole  of  the  //-space,  if  we 
desire  the  average  value  for  all  the  molecules  of  the  system. 

385.  Before  proceeding  let  us  call  attention  to  the  nature  of  the 
quantities  occurring  in  the  Maxwell-Boltzmann  equation.  In  accord- 
ance with  the  original  way  in  which  e  entered  the  derivation  of  the 
Maxwell-Boltzmann  equation  in  Chapter  4,  Section  39,  the  quantity  Si 
would  be  the  rate  of  change  in  the  energy  of  the  system  per  molecule 
that  is  added  to  the  i'th  region  do,  zvlicn  the  system  is  in  its  state  of 
niuxiiJiuin  probability  with  energy  E,  volume  V  and  total  number  of  moie- 
cules  N,  in  accordance  with  the  equation 

e,  =  §^  (688) 

Hence,  except  in  the  case  of  perfect  gases,  e  may  be  not  only  a  func- 
tion of  the  region  in  the  //-space  do,  but  may  also  depend  on  the  parameters 
A''^  E  and  V  which  determine  the  whole  state  of  the  system,  since  the 
characteristics  of  the  most  probable  state  will  depend  on  those  quantities. 
As  to  the  quantities  t/'  and  T  in  the  Maxwell-Boltzmann  expression, 
it  is  evident  that  they  are  functions  of  the  whole  state  of  the  system 
as  given  by  the  parameters  N,  E  and  V. 

386.  It  will  also  be  advantageous  before  proceeding  to  obtain  a 
mathematical  identity  which  will  later  prove  useful.  In  accordance  with 
the  distribution  law  we  may  write  for  the  total  number  of  molecules 


N 


/yp  —  e 
-PT  do  (689) 

Dififerentiating  with  respect  to  the  parameter  E,  keeping  the  other  two 
parameters  V,  and  A^  constant,*  we  obtain 

(I) ,..  =  J4  (  -  )  -  =/- M  (^)  -  =  0  (-) 

where  the  expression  is  equal  to  zero,  since  A^  is  constant  by  hypothesis. 

*  It  should  be  noted  that  the  constancy  of  V  makes  it  possible  to  differentiate 
solely  back  of  the  integral  sign,  since  under  these  circumstances  the  limits  of  integra- 
tion are  constant. 
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387.    We  are  now  ready  to  consider   the   Boltzmann   expression    for 
entropy. 

S  =  k  log  W  +  a 

In  accordance  with  equation  (38)  in  Chapter  4,  we  may  write  for  the 
y-weight  of  any  state  of  the  system 

log  y  =  N  log  N  —  ^Ni  log  A^i  +  log  da  (691 ) 

t 

where  Ni  is  the  number  of  molecules  falling  in  the  i'th  region  of  the 
//-space.  Considering  now  the  state  of  equilibrium  or  maximum  prob- 
ability, we  may  substitute  the  Maxwell-Boltzmann  expression  for  A^^,  and 
remembering  that  probability  is  proportional  to  y-weight,  write  in  accord- 
ance with  the  Boltzmann  expression  for  entropy 


e   kT    log  I  c   kT    do  \  do  -\-  const. 


(692) 


where  for  convenience  we  have  changed  to  the  language  of  the  inteoral 
calculus.  In  accordance  with  equation  (687)  for  averages  we  may  now 
write 

S  =  k  log  PV  ^  const.  = = — ^  4-  const.  (693) 

as  the  Boltzmann  expression  for  entropy. 

It  will  be  seen  from  the  development  that  the  constant  in  this  equa- 
tion can  depend  on  the  number  of  molecules  A^  and  on  the  size  chosen 
for  do,  as  well  as  on  the  original  undetermined  constant  a  in  the  Boltz- 
mann equation,  which  could  itself  be  a  function  of  A^.  If  we  drop  the 
constant  in  (693)  we  shall  later  find  that  the  correct  dependence  of  vS' 
on  A''  is  obtained,  and  since  we  are  not  now  interested  in  absolute  values 
of  entropy,  it  will  be  most  convenient  to  do  this  and  write 

„       A'e  —  Nw 

^  = f — -  (694) 

388.  We  may  now  investigate  to  see  if  this  equation  for  entropy 
obtained  by  the  Boltzmann  method  really  does  have  the  known  properties 
of  the  thermodynamic  quantity  5". 

Except  for  the  additive  constant,  in  which  we  are  not  now  interested, 
the  entropy  of  a  homogeneous  system  can  be  completely  defined  by  the 
thermodynamic  equations 
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1 


(695) 
^  (696) 


ns\ 

\dE  )r,N 
/6S\ 

The  first  two  of  these  equations  are  direct  consequences  of  the  funda- 
mental equation 

iS='^I^  (698) 

and  the  third  equation  results  from  the  fact  that  the  entropy  of  a  system 
is  always  taken  as  the  sum  of  the  entropies  of  its  parts,  and  hence  for  a 
homogeneous  system,  which  has  the  same  pressure  and  temperature 
throughout,  the  entropy  will  be  proportional  to  the  number  of  molecules. 
Let  us  now  show  that  the  quantity  given  by  equation  (694)  actually 
does  satisfy  these  thermodynamic  equations. 

389.    We    may    first    investigate    the    derivative    with    respect    to    E. 
Noting  equation  (687)  for  the  average  value  of  a  quantity,  we  may  write 

(d    Ne  —  Nip  \  d     r    ^-^  s  —  w    , 

Remembering  that  ip  and  T  are  independent  of  o,  this  may  be  rewritten 
in  the  form 

In  accordance  with  equation   (690)    it  is  evident,  however,  that  the  last 

two  terms  in  this  equation  are  zero.     As  to  the  remaining  term  on  the 

right-hand   side   of   the   above   equation,   it    is   evident  that   the   quantity 

d    /    £=zl\ 

t-ftI  e    I'T    fdo  is  the  rate  of  change  in  the  number  of  molecules  m  region 

do  with  change  in  E,  while  £  is  the  rate  in  change  of  the  energy  per 
molecule  added  to  the  region  do.  Hence  the  value  of  the  definite  integral 
is  evidently  dE/dE  =  1.     Our  expression  thus  reduces  to 
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/  d    Ne  —  Nw  \  1 

{■dE—T^),,,=  T  (699) 

thus  agreeing  with  the  first  of  the  three  necessary  equations  (695),  (696) 
and  (697). 

390.    Let  us  now  show  that  the  rate  of  change  of  —  with 

respect  to  volume  has  the  required  value.  In  our  previous  discussions 
we  have  made  use  of  the  principle  that  an  isolated  system,  with  a  definite 
energy  content  £  in  a  container  of  definite  volume  V,  will  assume  the 
state  of  maximum  probability.  For  our  present  purposes,  we  shall  make 
use  of  the  equally  valid  principle,  that  the  system  will  assume  the  state 
of  maximum  probability  when  it  is  enclosed  in  a  cylinder  with  walls 
which  do  not  permit  the  transfer  of  energy  and  provided  with  a  movable 
piston,  thus  allowing  a  simultaneous  variation  in  the  parameters  E  and  V, 
in  accordance  with  the  equation 

dE  =  —  PdV  (700) 

Now  we  have  already  found,  in  accordance  with  equation  (693), 
that  when  the  parameters  E  and  V  are  not  allowed  to  vary,  the  prob- 
ability IV  of  the  state  finally  attained,  is  given  by  the  expression 

log  W  =  ^^^j!^'^'  +  const.  (701 ) 

The  variation  in  this  quantity  with  the  parameters  E  and  V  will  evi- 
dently be 

S.,rV=  (^%^),.,,  ^^+  (^%^),,.  ^^  (702) 

In  the  special  case  that  we  are  considering,  IV  has  the  maximum  pos- 
sible value  not  only  for  changes  in  internal  arrangement  with  constant 
values  of  E  and  V  but  also  for  variations  in  E  and  V  which  do  not 
conflict  with  the  restriction  imposed  by  equation  (700).  Hence  by  in- 
troducing equation  (700)  and  applying  the  condition  for  maximum  prob- 
ability, we  obtain 

Multiplying  by  k,  dividing  by  dl\  and  introducing  the  expression  we  have 
already  found  for  the  rate  of  change  of    ^^ — -    with  E,  we  obtain 
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/  d  N~e-~Nip\  P  ,„^^, 

^    \W-^^),,,=T  ^'^'^ 

and  have  thus  shown  the  desired  agreement  with  the  second  of  our  three 
necessary  equations  (695),  (696)  and  (697). 

391.  In  order  to  show  the  agreement  of  the  quantity  that  we  have 
chosen  for  entropy  with  the  third  of  the  three  equations,  we  may  first 
obtain  an  auxiliary  equation  by  differentiating  the  expression  for  the 
total  number  of  molecules 

/■A  — e 
e  kT    do 

with  respect  to  the  number  of  molecules  A^,  holding  the  pressure  P  and 
temperature  T  constant.  In  carrying  out  such  a  differentiation,  it  is  to 
be  noted  that  we  cannot  consider  the  limits  of  the  definite  integral  as 
constant,  since  the  volume  F  of  a  homogeneous  system  at  a  given  pres- 
sure and  temperature  is  evidently  proportional  to  the  number  of  mole- 
cules A^,  and  the  volume  over  which  the  generalized  space  do  is  to  be 
integrated  is  proportional  to  V.  Keeping  this  consideration  in  mind,  we 
may  write 

/dN\  Cd    {    t^=i±\    ,     ,    /    ^^\        6»oii„, 


lim. 


dN 


Noting  further  that  the  last  term  in  this  equation  is  evidently  itself  equal 
to  dN/dN,  and  that  de/dN  is  evidently  zero  since  we  have  in  no  way 
changed  the  condition  inside  of  any  given  region  do,  we  obtain 


/ 


or  remembering  that  ij/  and  its  average  value  for  the  system  are  the  same, 
we  may  write 

=  0  (704) 


\dNjp, 


We  may  now  differentiate  our  expression  for  entropy  with  respect  to 
A^,  and  obtain 


/  d    Ne  —  Nyj\ 
\dN         T        / 


e  —  y>        N  de         N  dy) 

P_T  ^~f       ^TW~~  TdN 


In  accordance  with  the  foregoing,  however,  the  last  two  terms  of  this 
equation  are  zero.  Hence  we  may  rewrite  the  above  equation  in  the 
form 
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/  d    Nl-Nxp\         _  1  (Ne-Nxp\ 

\dN         T        )p,r~N\        T        )  ^     ^ 

and  have  thus  shown  the  agreement  of  the  expression  which  we  have 
chosen  for  entropy  with  the  third  and  last  of  the  three  necessary  equa- 
tions (695),  (696)  and  (697). 

The  foregoing,  unfortunately  but  necessarily  lengthy  investigation 
illustrates  by  a  special  example  the  fact  that  the  Boltzmann  equation 
does  give  a  correct  expression  for  entropy,  and  thus  permits  the  transla- 
tion of  statistical  mechanical  results  into  thermodynamic  language.  If, 
for  illustration,  we  should  desire  the  value  of  the  Helmholtz  free  energy 
of  the  system,  we  could  write 

F  =  E  —  TS  =  E—Ne-]-Nxp  (706) 

or  in  accordance  with  equation   (686) 

F  =  E  —  Nl-\-  NkT  log  N  —  NkT  log  f^?  ~Fr  do  (707) 

and  for  a  case  such  as  a  perfect  gas,  in  which  the  difference  between 
the  first  two  terms  could  be  neglected,  this  would  reduce  to 

F  =  NkT\ogN  —  NkT\ogfe~lfdo  (708) 

The  extension  to  cases  of  more  than  one  component  and  to  quantized 
degrees  of  freedom  will  be  evident.  It  should  be  especially  noted  that  a 
clear  idea  as  to  the  meaning  of  s,  as  the  rate  of  change  of  energy  per 
molecule  introduced  into  the  region  do,  will  facilitate  the  application  to 
concentrated  systems.  Only  for  dilute  systems  is  e  the  energy  of  an 
individual  molecule. 

392.  Relation  betv\reen  Statistical  Mechanics  and  the  Third  Law 
o£  Thermodynamics. — In  conclusion  we  must  consider  the  relation  be- 
tween statistical  mechanics  and  the  so-called  third  law  of  thermodynamics. 

The  classical  thermodynamics  was  only  concerned  with  the  changes  in 
energy  AE  and  changes  in  entropy  AS  which  accompany  changes  in  the 
state  of  a  system,  but  did  not  consider  the  absolute  values  of  these  quan- 
tities. Since  the  time  of  the  origin  of  the  classical  thermodynamics, 
the  development  of  the  theory  of  relativity  has  furnished  us  with  a  rela- 
tion between  mass  and  energy,  E  =  mc^,  which  gives  us  a  rational  start- 
ing point  for  energy  values.  The  third  law  of  thermodynamics  endeavors 
in  a  similar  way  to  furnish  a  rational  starting  point  for  entropy  values. 

393.  The  safest  method  to  attack  this  problem  is  to  use  the  methods 
of   Chapter   13,   where   we   showed   from  a  consideration   of   the  vapor 
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pressures  of  crystals,  that  a  system  of  crystalline  substances  at  the  abso- 
lute zero  must  have  the  same  entropy  in  any  form  of  chemical  combina- 
tion. This  makes  it  possible  to  take  the  entropy  as  zero  for  all  crystalline 
substances  at  the  absolute  zero  of  temperature. 

394.  Another  mode  of  attack  which  has  been  largely  carried  out 
under  the  influence  of  the  work  of  Planck  is  to  return  to  the  Boltzmann 
equation  connecting  entropy  and  probability 

S  =  k\ogW-\-a 

and  by  dropping  the  undetermined  constant  a  secure  a  relation 

S  =  k\ogW  (709) 

which  will  give  entropy  an  absolute  value  provided  the  probability  has 
such.^ 

In  the  classical  mechanics,  of  course,  no  absolute  value  could  have 
been  assigned  to  the  probability  IV,  since  the  numerical  value  would 
obviously  depend  on  the  size  do  chosen  for  the  regions  of  equal  probability 
in  the  phase  space.  With  the  advent  of  quantum  theory,  however,  we 
might  expect  an  absolute  value  of  JV  owing  to  the  existence  of  a  finite 
number  of  discrete  quantum  states  of  the  system.  This  is  the  essence 
of  the  dependence  of  the  third  law  on  quantum  theory. 

In  order  to  avoid  all  difficulties  as  to  the  designation  of  equally 
probable  conditions,  Planck's  latest  proposal  is  to  define  entropy  by  the 
equation 

S=klogP  (710) 

where  P  is  the  number  of  all  the  stationary  states  which  the  system 
can  assume  without  overstepping  the  energy  E  which  it  has.*^  In  the 
investigation  in  which  this  definition  is  considered,  it  is  shown  that  the 
equation  leads  in  known  cases  to  the  familiar  expressions  for  entropy, 
and  indications  as  to  the  importance  of  the  definition  for  the  problem  of 
gas  degeneration  are  presented. 

This  must  conclude  our  somewhat  brief  treatment  of  the  relation  be- 
tween statistical  mechanics  and  thermodynamics.  It  is  hoped,  however, 
that  sufficient  has  been  presented  to  make  the  application  to  actual  prob- 
lems possible. 

^  For  an  analysis  of  this  equation,  see  Schrodinger,  Berlin  Acad,  of  Science, 
p.  434  (1925). 

*  Planck,  Zeitschr.  fur  Phys.,  35,  155  (1925)  ;  compare  equations  (710)  and  (683). 

In  obtaining  the  number  of  states  P,  states  which  differ  solely  by  the  interchange 
of  identical  atoms  or  other  elements  are  not  to  be  counted  separately.  This  secures 
the  proper  dependence  of  5"  on  A^. 


Chapter  24 
Concluding  Remarks 

395.  Introduction.' — In  conclusion  we  shall  make  brief  mention  of 
several  important  fields,  susceptible  to  statistical  mechanical  treatment, 
which  unfortunately  cannot  be  fully  treated  in  the  present  book.  We  shall 
also  try  to  indicate  something  as  to  the  future  outlook  for  statistical 
mechanical  investigations. 

396.  Effect  of  External  Fields  of  Force  on  Systems  of  Mole- 
cules.— In  the  applications  to  physical-chemical  systems  which  we  have 
hitherto  considered,  we  have  assumed  in  general  the  absence  of  any  ex- 
ternal field  of  force.  The  methods  of  statistical  mechanics,  particularly 
those  furnished  by  the  Maxwell-Boltzmann  distribution  law,  can,  how- 
ever, often  be  applied  to  investigate  the  action  of  such  fields  on  systems  of 
molecules. 

397.  The  simplest  case  would  be  the  efifect  of  an  external  graznta- 
tional  field  in  producing  changes  in  density  with  height,  but  this  problem 
is  evidently  merely  a  special  case  of  the  general  problem  of  determining 
the  relation  between  pressure  and  density,  and  does  not  need  separate 
treatment.  The  direct  treatment  of  the  problem  for  a  perfect  gas  is, 
however,  perhaps  so  simple  as  to  prove  instructive. 

Consider  a  tube  of  gas  parallel  to  a  uniform  gravitational  field 
which  produces  the  acceleration  g  in  the  ^'-direction.  We  can  then  write 
the  Maxwell-Boltzmann  distribution  law  in  the  form 

Co  -I-  mgx 

dN  =  NCe  kf     dxdydzdq^ dpn  (711) 

where  ingx  is  the  only  part  of  the  energy  dependent  on  the  height  x,  and 
integrating  for  all  values  of  the  variables  except  x  we  obtain 

mgx 

dN  =  NACe — kT  dx  (712) 

where  ^  is  a  constant  resulting  from  the  integration.  This  equation 
immediately  gives,  however,  the  well-known  distribution  of  concentration 
with  height,  provided  we  determine  the  constants  so  as  to  account  for  all 

315 
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the  gas  in  the  tube.  In  the  case  of  imperfect  gases  or  Hquids,  the  energy  e 
at  a  height  x  would  be  dependent  both  on  gravity  and  concentration  and 
the  treatment  would  be  more  complicated. 

398.  The  effect  of  an  external  electric  field  ^  in  polarizing  a  gas  can 
also  be  treated  with  the  help  of  the  Maxwell-Boltzmann  distribution  law, 
and  this  has  been  done  by  Debye  -  on  the  basis  of  the  classical  theory  and  by 
Pauli  ^  and  Pauling  *  on  the  basis  of  the  quantum  theory.^ 

The  general  nature  of  the  expected  effect  will  be  evident.  The 
applied  electric  field  will,  in  the  first  place,  induce  an  electric  moment 
in  the  molecules  of  the  gas  by  deforming  them,  and,  in  the  second  place, 
will  tend  to  orient  the  permanent  dipoles,  already  present,  parallel  to 
the  field.  The  total  polarization  of  the  gas  will  be  the  sum  of  the 
polarizations  produced  in  these  two  ways,  the  first  term  being  at  least 
approximately  independent  of  the  temperature  and  the  second  term  evi- 
dently decreasing  with  the  temperature  as  the  alignment  of  the  dipoles  is 
more  and  more  disturbed. 

The  Maxwell-Boltzmann  distribution  law  can  of  course  immediately 
be  applied  to  calculate  the  relative  number  of  dipoles  oriented  in  differ- 
ent directions,  since  the  energy  e  occurring  in  the  Maxwell-Boltzmann 
expression  will  contain  a  term  dependent  on  the  angle  between  the  axis 
of  the  dipole  and  the  direction  of  the  field.  Nevertheless,  a  difficulty 
arises  until  we  are  certain  as  to  whether  the  possible  angles  are  to  be 
regarded  as  continuously  distributed  in  accordance  with  classical  con- 
siderations, or  confined  to  the  special  values  given  by  the  theory  of  spatial 
quantization.  The  problem  is  one  of  the  greatest  importance  and  can- 
not, be  solved  until  we  are  more  informed  as  to  the  effect  of  molecular 
collisions  in  disturbing  spatial  quantization. 

Unfortunately  we  cannot  here  treat  this  field  in  more  detail.  It  may 
be  remarked,  however,  that  the  interpretation  of  electric  polarization  and 
dielectric  constants  by  the  classical  method  of  Debye  gives  much  larger 
values  to  the  electric  moments  of  the  dipoles  than  does  the  quantum  theory 
of  Pauli  using  wliolc  quantum  numbers,^  and  this  in  turn  much  larger 
values  than  the  theory  of  Pauling  using  Jialf  quantum  numbers.' 

399.  The  effect  of  an  external  magnetic  field  in  causing  the  magnetic 
polarization  of  a  paramagnetic  substance  can  also  be  similarly  undertaken 

^For  the  most  complete  report  on  the  effects  of  electric  and  magnetic  fields, 
see  Debye  in  Marx  "Handbuch  der  Radioligie,"  Volume  V,  page  597,  Leipzig,  1925 

=* Debye,  Phvsikal.  Zcitiiug..  13,  97  (1912). 

'  Pauli,  Zcitschr.  fiir  Phvs.,  6.  319  (1921). 

*  Pauling,  Phys.  Rev.,  27,  568  (1926). 

°  Since  the  above  was  written,  there  has  appeared  a  note  on  this  problem,  using 
the  new  dynamics  of  Heisenberg  and  Born,  by  Kronig,  Proc.  Nat.  Acad.,  12,  488 
(1926). 

"  See  Zahn,  Phys.  Rev.,  24,  400  (1924). 

'  See  Pauling,  loc.  cit. 
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with  the  help  of  the  Maxwell-Boltzmann  distribution  law  as  first  shown 
by  Langevin  ^  in  accordance  with  the  classical  point  of  view,  and  as 
treated  by  Pauli,^  Epstein  ^^  and  Gerlach  ^^  using  the  quantum  theory. 
The  results  on  the  whole  seem  to  favor  the  application  of  spatial  quantiza- 
tion, as  it  gives  magnetic  moments  corresponding  to  the  Bohr  magneton, 

400.  For  a  time  some  light  seemed  to  be  thrown  on  the  problem  of 
spatial  quantization  in  the  presence  of  collisions  by  the  work  of  Glaser/' 
which  indicated  that  the  susceptibility  of  diamagnetic  gases  increases  at 
low  pressures,  when  the  time  between  collisions  becomes  of  the  order 
of  the  time  of  the  Larmor  precession.  The  later  work  of  Lehrer,^^  how- 
ever, as  well  as  work  of  Hamar  in  the  Norman  Bridge  Laboratory,  indi- 
cate that  Glaser's  findings  were  not  justified.  At  the  present  time  it  seems 
as  if  spatial  quantization  should  be  applied  at  ordinary  pressures,  both  in 
the  case  of  electrical  and  magnetic  fields. 

401.  The  Interionic  Attraction  Theory. — In  recent  years  the  Max- 
well-Boltzmann distribution  law  has  been  used  in  a  very  interesting 
manner  to  investigate  the  distribution  of  the  charged  ions  in  an  electro- 
lytic solution.  The  theory  has  been  specially  developed  by  Milner  ^'^  and 
Debye,^'  and  many  experimental  tests  of  the  conclusions  have  been  pre- 
sented by  Noyes  ^"^  and  others. 

The  general  nature  of  the  expected  efifect  will  be  evident.  Since  the 
electric  field  of  force  around  an  ion  falls  ofif  only  with  the  square  of 
the  distance,  it  is  not  possible  as  in  so  many  applications  of  statistical 
mechanics  to  neutral  molecules,  to  assume  that  charged  molecules  have 
no  effect  on  each  other  except  in  the  short  range  of  action  that  we  ascribe 
to  ordinary  collision.  Rather  we  must  believe  that  each  positive  ion  in 
a  solution  tends  to  concentrate  negative  ions  in  its  vicinity  and  vice  versa. 
This  change  from  uniform  distribution  is  undoubtedly  one  of  the  fac- 
tors which  prevents  ionic  substances  from  acting  as  perfect  solutes,  and 
hence  its  consideration  is  of  interest  in  view  of  the  historical  importance 
of  the  theory  of  electrolytic  solutions. 

A  satisfactory  calculation  of  the  distribution  of  ions  in  an  electrolytic 

'Langevin,  Jotum.  de  Phys.,  4,  678  (1905)  ;  Am.  dc  Chim.  et  Phys.,  5,  70  (1905). 

"Pauli,  Phys.  Zeitschr.,  21,  615  (1920). 

'"Epstein,  Science,  57,  532  (1923). 

"Gerlach,  Phys.  Zeitschr.,  24,  275  (1923). 

"Glaser,  Ann.  der.  Ph\s.,  75,  459  (1924)  ;  78,  641  (1925). 

"Lehrer,  Zeitschr.  fiir  Phys.,  37,  155  (1926).  Hammar,  Proc.  Nat.  Acad.,  12, 
594  (1926). 

"Milner,  Phil.  Mag.,  23,  551   (1912)  ;  225,  742  (1913). 

''Debye  and  Hiickel,  Physik.  Zeitschr.,  24,  185  (1923);  Debye,  Ibid.,  24,  334 
(1923). 

'"Noyes,  Journ.  Amcr.  Cliem.  Soc,  46,  1080  (1924),  and  a  series  of  following 
articles  the  latest  being  by  Sherrill  and  Noyes,  Ibid.,  48,  1861  (1926). 
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solution  is,  however,  not  an  easy  matter.  If  we  had  only  a  single  posi- 
tive ion  and  a  single  negative  ion  with  charges  ±  e,  it  is  evident,  from 
the  Maxwell-Boltzmann  distribution  law,  that  the  chance  of  there  lying 
a  distance   between   r  and   r  -\-  dr   apart,    could   be   put   proportional   to 

Anr^e  rkrdr.  In  the  case  of  a  solution  of  moderate  concentration,  how- 
ever, the  tendency  of  ions  of  one  sign  to  cluster  around  an  ion  of  the 
other  sign  greatly  changes  the  effective  field  of  the  latter  and  the  calcula- 
tion of  the  final  distribution  has  been  possible  only  with  the  help  of 
approximations.  The  seriousness  of  these  approximations  has  recently 
been  emphasized  by  Bjerrum."  Furthermore,  it  has  been  pointed  out 
by  Zwicky  ^®  that  the  most  probable  distribution  of  ions  will  be  greatly 
affected,  even  at  extremely  low  concentrations,  by  the  potential  energy 
of  the  solvent,  which  is  subjected  to  an  electrostatic  compression  which 
varies  with  the  distribution  of  the  charged  ions.  It  seems  possible  that  a 
completely  satisfactory  form  of  the  interionic  attraction  theory  has  not 
yet  been  given.  Further  theoretical  work  on  the  problem  will  be  of 
considerable  interest. 

402.  Fluctuations. — The  whole  application  of  statistical  mechanics 
to  physical-chemical  systems  is  based  on  the  assumption  of  the  atomic 
structure  of  matter.  Such  a  discrete  structure  of  matter,  however, 
ever,  immediately  entails  fluctuations  in  the  condition  of  a  system  even 
when  it  is  in  equilibrium  and  fluctuations  in  rate  when  a  macroscopic 
physical  chemical  change  is  under  way.^^ 

When  the  elementary  molecular  processes  happen  to  be  of  such  a 
nature  as  to  permit  observation,  both  these  kinds  of  fluctuations  can  be 
directly  observed.  Thus  the  particles  of  a  colloidal  suspension,  which 
can  be  seen  in  the  microscope,  may  be  regarded  as  individual  molecules, 
and  their  Brownian  fluctuations  in  position  and  velocity  directly  observed, 
with  the  well-known  result  that  the  individual  particles  are  found  under 
equilibrium  conditions  to  have  the  average  kinetic  energy  ^kT  per  degree 
of  freedom,  as  theoretically  predicted  by  statistical  mechanics.  Fluctua- 
tions in  the  rate  of  radioactive  decay  can  also  be  directly  followed  by 
observing  the  scintillations  of  the  alpha  particles  which  are  ejected. 

Even,  however,  when  the  elementary  molecular  processes  cannot  be 
directly  observed,  the  existence  of  both  kinds  of  fluctuations  may  be 
apparent.  Thus,  as  first  appreciated  by  Smoluchowski,^"  the  opalescence 
of  a  substance  in  the  neighborhood  of   its  critical  point,  and   of   liquid 

"Bjerrum,  Dct  Kgl.  Dauskc   J'idcns.  Sclskab.  Math.  fys.  Med.,  VII,  9   (1926). 
"Zwicky,  Physik.  Zcitschr.,  27,  271    (1926). 

'"  For  a  review  of  the  general   field  of   fluctuations,  see   Fiirth,   "Schwankunger- 
scheinungen  in  der  Physik,"  Sammlung  Veiweg,  Braunschweig,   (1920). 
'"  Smoluchowski,  Ann.  der  Phys.,  25,  205   (1908). 
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mixtures  in  the  neighborhood  of  the  conditions  for  complete  mutual 
solubility,  are  due  to  fluctuations  from  a  state  of  uniform  density  or 
concentration.  And  as  first  studied  by  Schottky,-^  fluctuations  exist  in 
the  rate  at  which  an  electric  current  flows  from  a  hot  filament,  which  on 
analysis  are  shown  to  be  due  to  the  fact  that  the  current  results  from 
the  emission  of  individual  electrons. 

403.  For  the  case  of  the  macroscopic  fluctuations  of  a  system  which 
is  in  a  state  of  equilibrium,  a  very  simple  and  beautiful  theory  has  been 
developed  by  Einstein,--  which  we  must  briefly  describe. 

Consider  the  fluctuations  of  some  observable  property  of  a  system 
which  is  in  a  state  of  equilibrium.  Let  Xq  be  the  value  of  the  quantity 
measuring  this  property  when  the  system  is  in  its  state  of  maximum 
probability,  and  let  us  take 

dlV  =  Adx  (713) 

as  the  probability  that  the  quantity  will  be  in  the  range  between  x  and 
X  -\-  dx,  in  the  course  of  its  fluctuations. 

The  quantity  A  in  this  equation  will  evidently  vary  with  the  value  of  x, 
becoming  rapidly  smaller  the  more  x  differs  from  its  most  probable  value 
Xq.  The  main  factor  determining  A  can  be  determined,  however,  from 
the  Boltzmann  relation  connecting  probability  and  entropy 

S  =  k\ogW  -\-a  (714) 

In  accordance  with  this  equation,  we  may  write  for  the  state  of  maximum 
probability 

S,  =  k\ogWo^-a  (715) 

where  ^o  and  Wo  are  the  entropy  and  probability  of  this  state.  Or  since 

the  probability  W  will  be  of  the  order  of  unity  for  the  most  probable 
state,  we  may  write 

So^a  (716) 

Furthermore  for  the  state  of  interest  we  can  write 

S  =  k\og{dW)  -\-a  {7X7) 

and  combining  (716)  and  (717),  obtain 

dW^e~'^ir~  (718) 

We  thus  see,  since  k  is  very  small,  compared  with  appreciable  differences 
between  the  maximum  entropy  Sq  and  the  entropy  S  of  the  state  of 
interest,  that  the   main   reason,  why  the  cjuantity  A   in  equation    (713) 

"Schottky,  Ann.  dcr  Phys.,  57.  514  (1918)  ;  68,  157  (1922). 
"=■  Einstein,  Ann.  der  Phys.,  33,  1275  (1910). 


320  STATISTICAL  MECHANICS  [Ch.  24 

decreases  as  x  fluctuates  away  from  the  most  probable  value  jTq  is  due 
to  the  increasing  value  of  the  difiference  Sq  —  S.  Hence  if  we  rewrite 
equation  (713)  in  the  form 

dW  =  Be~~^l^  dx  .  (719) 

It  is  evident  that  the  dei3endence  of  B  on  .r  will  be  much  less  sharp  than 
that  of  A. 

We  may  now  transform  equation  (719)  to  advantage  with  the  help 
of  the  thermodynamic  relations  between  free  energy,  entropy  and  tem- 
perature 

and  (720) 

F  =  E  —  TS 

And  since  the  energy  of  the  system  is  the  same  in  the  two  states  and 
the  temperature  is  not  appreciably  affected  by  the  fluctuation,  we  may 
substitute 

S,-S=^^^  (721) 

in  equation  (719)  and  write 

dW  =  Bc~  ^Tf^  dx  ( 722 ) 

Since  now  we  are  interested  in  states  of  fluctuation  in  the  neighbor- 
hood of  the  most  probable  value  x  =  Xo,  we  may  develop  B  and  F  —  Fq 
as  power  series  of  the  form 

B=Bo  +  B,(x  —  Xo)  +  ....  (723) 

(F  —  Fo)=a(x  —  Xo)  +1  (x  —  Xor  +  ....  (724) 

Owing,  however,  to  the  very  large  factor  1/k  in  the  exponent  of  equation 
(722),  and  the  fact  that  we  saw  above  that  B  does  not  vary  sharply 
with  X,  we  may  neglect,  in  the  neighborhood  of  x  =  Xo,  all  but  the  first 
term  of  (723).  Furthermore  in  (724)  the  first  term  is  zero  owing  to 
the  fact  that  we  are  in  the  neighborhood  of  minimum  free  energy,  and 
higher  terms  than  the  second  may  be  neglected  owing  to  the  small  value 
of  (x  —  Xo).     So  that  we  may  rewrite  equation  (722)  in  the  form 

dW  =  Boe        FT^  dx  =  BoC~l  '  kr"     dx  (725 ) 

and  by  the  usual  process  of  averaging  obtain  for  the  mean  fluctuation  of 
the  free  energy  as  dependent  on  any  quantity  x 
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(i^_Fo)av.=  [I  {x-x,Y^^^^  =^  (726) 

Since  (F  —  Fo)  is  merely  the  work  that  has  to  be  done  in  order  to 
disturb  the  system  from  its  most  probable  state  to  the  state  of  interest, 
equations  (725)  and  (726)  can  easily  be  applied. 

Thus,  for  example,  equation  (726)  gives  at  once,  that  the  mean  com- 
ponent in  any  direction  of  the  kinetic  energy  of  a  Brownian  particle  will 
be  kT/2,  and  its  mean  height  above  the  floor  of  a  vessel  will  be  given 
by  the  relation  mgh^  kT/2,  while  equation  (725)  correctly  gives  the 
probability  of  different  kinetic  energies  and  heights. ^^ 

404.  As  another  example  we  may  calculate  the  fluctuation  in  the 
density  of  a  fluid.  If  z/q  is  the  mean  specific  volume  of  the  fluid  and  v  is 
the  specific  volume  which  occurs  for  a  mass  nt  located  somewhere  within 
the  fluid,  as  a  result  of  fluctuations,  we  may  write 

F  —  F,=^~mC{p  —  po)dv  (727) 

Expanding  p,  however,  in  the  form  of  a  series,  we  may  write 

^  ^  ^°  +  1^  ^^-  ^°)  +  ^0  (^-  ^^o)^  +  .  . .  (728) 

and  neglecting  the  second  order  and  higher  terms  obtain  by  substitution  in 
{727) 

Equation  (729)  may  now  be  substituted  in  (725)  and  (726)  and  the 
probability  for  a  given  fluctuation  in  the  specific  volume  and  the  mean 
fluctuation  in  the  specific  volume  determined.  Thus  for  the  latter  we 
obtain 

»»  [si,)  ^'-^ 

We  see  that  the  fluctuations  are  smaller  the  greater  the  mass  of  fluid  m 
considered. 

For  the  case  of  a  perfect  gas,  we  have 

RT 

""See  Fiirth,  Ann,  der  Fhys.,  53,  208  (1916), 
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where  M  is  the  molecular  weight,  and  by  substituting  in  (730),  can  obtain 

¥=j^  (731) 

where  A^"  is  the  number  of  molecules  in  the  region  considered.     It  is  thus 

evident,  that  the  fluctuations  in  the  density  of  a  given  region,  inside  a 

volume  of  gas,  would  be  too  small  to  detect,  but  if  actually  measurable 

would  furnish  a  means  of  determining  Avogadro's  number. 

In  case  of  a  fluid  in  the  neighborhood  of  its  critical  point,  however, 

fit)  B^b 

the   quantities    -^    and    -~   approach    zero,    so    that    the    fluctuations 

become  measurable  and  as  studied  by  opalescence  measurements  are  shown 
to  be  quantitatively  in  agreement  with  the  theory,"*  thus  furnishing  an 
approximate  method  of  determining  Avogadro's  number. 

405.  Conclusion. — It  is  hoped  that  the  foregoing  discussion  of 
statistical  mechanics  and  its  applications  will  be  sufficient  to  assist  in  the 
further  development  of  this  important  science. 

It  appears  certain  that  chemists,  in  particular,  since  they  have  to  deal 
for  the  most  part  with  the  behavior  of  systems  containing  many,  many 
molecules,  will  need  to  make  increasing  use  of  this  science.  For  the 
treatment  of  chemical  equilibria,  it  will  be  most  satisfactory  for  them  to 
continue  the  use  of  the  simple  language  of  thermodynamics,  substituting 
into  its  equations,  at  will,  either  empirical  values  of  the  parameters  or 
theoretical  values  obtained  by  the  methods  of  statistical  mechanics.  In 
other  fields,  however,  they  will  have  to  depend  more  and  more  directly 
on  statistical  mechanical  considerations. 

For  the  physicists,  with  their  happy  facility  in  devising  experiments 
whose  result  depends  on  the  immediate  action  of  single  atoms,  the  science 
of  statistical  mechanics  is  of  less  vital  interest.  Nevertheless,  the  inter- 
change between  the  atomic  and  macroscopic  points  of  view  will  continue 
to  be  stimulating.  With  the  help  of  hypotheses  as  to  individual  atomic 
processes  we  predict  the  behavior  of  systems  containing  many  atoms, 
and  by  observing  this  behavior  we  verify  or  revise  our  original  starting 
point.  Thus  the  Schottky  effect  may  be  predicted  as  a  result  of  our 
knowledge  of  the  atomic  subdivision  of  electricity,  or  may  in  turn  be  used 
for  determining  the  charge  of  the  individual  electrons.  Atomic  hypothesis 
can  lead  to  macroscopic  predictions,  and  macroscopic  experiments  back 
to  atomic  knowledge. 

In  the  field  of  quantum  phenomena,  we  must  expect  in  the  near  future 
many  changes  from  the  methods  of  presentation  used  in  this  book.  To 
some  extent  these  will  consist  merely  in  matters  of  detail,  such  as  changes 

"Keesom,  Ann.  der  Phys.,  35,  591  (1911). 
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in  the  theoretical  methods  of  predicting  quantum  levels.  When  the  new 
quantum  dynamics  is  fully  developed,  however,  it  may  be  desirable  to 
present  the  foundations  of  statistical  mechanics  from  a  new  point  of 
view,  basing  the  science  from  the  very  start  on  the  new  dynamics,  instead 
of  first  developing  the  classical  statistical  mechanics  and  then  introducing 
modifications  to  take  account  of  quantum  phenomena. 

Among  the  immediate  developments  that  we  can  wish  for  statistical 
mechanics,  will  be  its  increasing  application  to  systems,  in  which  the 
closeness  of  molecules  or  the  electrical  charges  that  they  carry  prevents 
the  use  of  the  common  simplifying  assumptions,  which  neglect  all  but 
discontinuous  interactions.  The  work  of  Milner  and  Debye  on  interionic 
attraction  is  a  promising  development  of  this  kind.  If  some  general  and 
simple  method  for  the  rigorous  treatment  of  such  problems  could  be  dis- 
covered it  would  serve  as  a  great  step  forward. 

Further  applications  to  the  field  of  chemical  kinetics  are  also  to  be 
desired.  The  problem  of  reaction  velocities  is  probably  nearer  to  the 
heart  of  most  chemists  than  anything  else  in  their  whole  range  of  activity. 
Rates  of  reaction  are  the  factors  that  determine  yields,  and  costs,  and 
possibilities,  and  their  theory  must  eventually  succumb  to  scientific  treat- 
ment. Some  of  the  possibilities  of  attacking  the  problem  of  reaction 
velocity  have  been  presented  in  Chapters  20,  21,  and  22  of  this  book,  but 
the  field  still  remains  one  of  the  most  obscure  in  chemistry.  Perhaps 
new  principles  as  to  the  behavior  of  individual  atoms  will  be  necessary 
to  the  final  solution. 

Whatever  may  be,  the  way  is  long  and  the  traveller  must  not  tarry. 
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APPENDIX  II 
Some  Useful  Constants 

c  =  2.9986  X  10^°  cm.  sec.-^ 

e  =  4.774  X  10""  electrostatic  units. 

h  =  6.554  X  10-"  erg  sec. 

k  =  1.372  X  10"^^  ei*g  deg."^  per  molecule. 
N  =  6.061  X  10^^  molecules  per  mol. 
i?  =  8.315  X  10'  erg  deg."^  per  mol. 
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APPENDIX  III 

Molecular  Diameters   Calculated   from   Viscosity  and 
Van  der  Waals,  b. 

0  X  108  o  X  108 

Gas  (viscosity)        (Van  der  Waals) 

Argon  2.84  2.85 

Krypton 3.12  3.14 

Xenon     3.47  3.42 

Helium   1.89  1.96 

Oxygen    2.93  2.89 

Hydrogen     2.36  [2.52] 

(3,54 
Nitrogen     3.10  /3  08 

Air    3.08  3^30 

r-    u      T^-     -A  13.27  J3.40 

Carbon  Dioxide    |3  2o  {3.20 

(Taken  from  S.  Chapman,  Phil.  Trails.  Roy.  Soc,  A  216,  347  (1916). 
The  calculations  are  based  on  a  theory  of  viscosity  which  takes  account 
of  the  deviations  from  the  Maxwellian  distribution.) 
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